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(57) ABSTRACT 

Analytical reagents and mass spectrometry -based methods 
using these reagents for the rapid, and quantitative analysis 
of proteins or protein function in mixtures of proteins. The 
methods employ affinity labeled protein reactive reagents 
having three portions: an affinity label (A) covalently linked 
to a protein reactive group (PRG) through a linker group (L). 
The linker may be differentially iso topically labeled, e.g., by 
substitution of one or more atoms in the linker with a stable 
isotope thereof. These reagents allow for the selective iso- 
lation of peptide fragments or the products of reaction with 
a given protein (e.g., products of enzymatic reaction) from 
complex mixtures. The isolated peptide fragments or reac- 
tion products are characteristic of the presence of a protein 
or the presence of a protein function in those mixtures. 
Isolated peptides or reaction products are characterized by 
mass spectrometric (MS) techniques. The reagents also 
provide for differential isotopic labeling of the isolated 
peptides or reaction products which facilitates quantitative 
determination by mass spectrometry of the relative amounts 
of proteins in different samples. The methods of this inven- 
tion can be used for qualitative and quantitative analysis of 
global protein expression profiles in cells and tissues, to 
screen for and identify proteins whose expression level in 
cells, tissue or biological fluids is affected by a stimulus or 
by a change in condition or state of the cell, tissue or 
organism from which the sample originated. 

32 Claims, 9 Drawing Sheets 
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RAPID QUANTITATIVE ANALYSIS OF 
PROTEINS OR PROTEIN FUNCTION IN 
COMPLEX MIXTURES 

CROSS-REFERENCE TO RELATED 5 
APPLICATIONS 

This application is a division of U.S. patent application 
Ser. No. 09/388,062, filed Aug. 25, 1999, now U.S. Pat. No. 
6,670,194 which takes priority under 35 U.S.C. §119(e) 
from U.S. provisional application Ser. No. 60/097,788, filed 10 
Aug. 25, 1998, and Ser. No. 60/099,113, filed Sep. 3, 1998, 
all of which are incorporated in their entirety by reference 
herein. 

This invention was made through funding from the 
National Science Foundation Science and Technology Cen- 
ter for Molecular Biotechnology (grants 5T32HG and 
BIR9214821) and the National Institutes of Health (NIH 
grants RR11823, T32HG00035, HD-02274 and GM60184). 
The United States government has certain rights in this 2Q 
invention. 

BACKGROUND OF THE INVENTION 

Genomic technology has advanced to a point at which, in 
principle, it has become possible to determine complete 25 
genomic sequences and to quantitatively measure the mRNA 
levels for each gene expressed in a cell. For some species the 
complete genomic sequence has now been determined, and 
for one strain of the yeast Saccharomyces cervisiae, the 
mRNA levels for each expressed gene have been precisely 30 
quantified under different growth conditions (Velculescu et 
al., 1997). Comparative cDNA array analysis and related 
technologies have been used to determine induced changes 
in gene expression at the mRNA level by concurrently 
monitoring the expression level of a large number of genes 35 
(in some cases all the genes) expressed by the investigated 
cell or tissue (Shalon et al., 1996). Furthermore, biological 
and computational techniques have been used to correlate 
specific function with gene sequences. The interpretation of 
the data obtained by these techniques in the context of the 40 
structure, control and mechanism of biological systems has 
been recognized as a considerable challenge. In particular, it 
has been extremely difficult to explain the mechanism of 
biological processes by genomic analysis alone. 

Proteins are essential for the control and execution of 45 
virtually every biological process. The rate of synthesis and 
the half-life of proteins and thus their expression level are 
also controlled post-transcriptionally. Furthermore, the 
activity of proteins is frequently modulated by post- 
translational modifications, in particular protein 50 
phosphorylation, and dependent on the association of the 
protein with other molecules including DNA and proteins. 
Neither the level of expression nor the state of activity of 
proteins is therefore directly apparent from the gene 
sequence or even the expression level of the corresponding 55 
mRNA transcript. It is therefore essential that a complete 
description of a biological system include measurements 
that indicate the identity, quantity and the state of activity of 
the proteins which constitute the system. The large-scale 
(ultimately global) analysis of proteins expressed in a cell or go 
tissue has been termed proteome analysis (Pennington et al., 
1997). 

At present no protein analytical technology approaches 
the throughput and level of automation of genomic technol- 
ogy. The most common implementation of proteome analy- 65 
sis is based on the separation of complex protein samples 
most commonly by two-dimensional gel electrophoresis 
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(2DE) and the subsequent sequential identification of the 
separated protein species (Ducret et al., 1998; Garrels et al., 
1997; Link et al., 1997; Shevchenko et al., 1996; Gygi et al. 
1999; Boucherie et al., 1996). This approach has been 
revolutionized by the development of powerful mass spec- 
trometric techniques and the development of computer algo- 
rithms which correlate protein and peptide mass spectral 
data with sequence databases and thus rapidly and conclu- 
sively identify proteins (Eng et al., 1994; Mann and Wilm, 
1994; Yates et al., 1995). This technology has reached a level 
of sensitivity which now permits the identification of essen- 
tially any protein which is detectable by conventional pro- 
tein staining methods including silver staining (Figeys and 
Aebersold, 1998; Figeys et al., 1996; Figeys et al., 1997; 
Shevchenko et al., 1996). However, the sequential manner in 
which samples are processed limits the sample throughput, 
the most sensitive methods have been difficult to automate 
and low abundance proteins, such as regulatory proteins, 
escape detection without prior enrichment, thus effectively 
limiting the dynamic range of the technique. In the 2DE/ 
(MS)" method, proteins are quantified by densitometry of 
stained spots in the 2DE gels. 

The development of methods and instrumentation for 
automated, data-dependent electrospray ionization (ESI) 
tandem mass spectrometry (MS") in conjunction with micro- 
capillary liquid chromatography (*/LC) and database search- 
ing has significantly increased the sensitivity and speed of 
the identification of gel-separated proteins. As an alternative 
to the 2DE/MS" approach to proteome analysis, the direct 
analysis by tandem mass spectrometry of peptide mixtures 
generated by the digestion of complex protein mixtures has 
been proposed (Dongr'e et al., 1997).//LC-MS/MS has also 
been used successfully for the large-scale identification of 
individual proteins directly from mixtures without gel elec- 
trophoretic separation (Link et al., 1999; Opitek et al., 1997) 
While these approaches dramatically accelerate protein 
identification, the quantities of the analyzed proteins cannot 
be easily determined, and these methods have not been 
shown to substantially alleviate the dynamic range problem 
also encountered by the 2DE/MS/MS approach. Therefore, 
low abundance proteins in complex samples are also difficult 
to analyze by the //LC/MS/MS method without their prior 
enrichment. 

It is therefore apparent that current technologies, while 
suitable to identify the components of protein mixtures, are 
neither capable of measuring the quantity nor the state of 
activity of the protein in a mixture. Even evolutionary 
improvements of the current approaches are unlikely to 
advance their performance sufficiently to make routine 
quantitative and functional proteome analysis a reality. 

This invention provides methods and reagents that can be 
employed in proteome analysis which overcome the limita- 
tions inherent in traditional techniques. The basic approach 
described can be employed for the quantitative analysis of 
protein expression in complex samples (such as cells, 
tissues, and fractions thereof), the detection and quantitation 
of specific proteins in complex samples, and the quantitative 
measurement of specific enzymatic activities in complex 
samples. 

In this regard, a multitude of analytical techniques are 
presently available for clinical and diagnostic assays which 
detect the presence, absence, deficiency or excess of a 
protein or protein function associable with a normal or 
disease state. While these techniques are quite sensitive, 
they do not necessarily provide chemical speciation of 
products and may, as a result, be difficult to use for assaying 
several proteins or enzymes simultaneously in a single 
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sample. Current methods may not distinguish among aber- covalently linked to a protein reactive group (PRG) through 

rant expression of different enzymes or their malfunctions a linker group (L): 

which lead to a common set of clinical symptoms. The 

methods and reagents herein can be employed in clinical and A-L-PRG 

diagnostic assays for simultaneous (multiplex) monitoring 5 

of multiple proteins and protein reactions. The linker ma Y be differentially isotopically labeled, e.g., 

by substitution of one or more atoms in the linker with a 

SUMMARY OF THE INVENTION stable isotope thereof For example? hydrogens can be 

This invention provides analytical reagents and mass substituted with deuteriums or 12 C with 13 C. 

spectrometry-based methods using these reagents for the 1Q The affinity label A functions as a molecular handle that 

rapid, and quantitative analysis of proteins or protein func- selectively binds covalently or non-covalently, to a capture 

tion in mixtures of proteins. The analytical method can be reagent (CR). Binding to CR facilitates isolation of peptides, 

used for qualitative and particularly for quantitative analysis substrates or reaction products tagged or labeled with A. In 

of global protein expression profiles in cells and tissues, i.e. specific embodiments, A is a strepavidin or avidin. After 

the quantitative analysis of proteomes. The method can also 5 affinity isolation of affinity tagged materials, some of which 

be employed to screen for and identify proteins whose may be isotopically labeled, the interaction between A and 

expression level in cells, tissue or biological fluids is the capture reagent is disrupted or broken to allow MS 

affected by a stimulus (e.g., administration of a drug or analysis of the isolated materials. The affinity label may be 

contact with a potentially toxic material), by a change in displaced from the capture reagent by addition of displacing 

environment (e.g., nutrient level, temperature, passage of 2Q ligand, which may be free A or a derivative of A, or by 

time) or by a change in condition or cell state (e.g., disease changing solvent (e.g., solvent type or pH) or temperature 

state, malignancy, site-directed mutation, gene knockouts) conditions or the linker may be cleaved chemically, 

of the cell, tissue or organism from which the sample enzymatically, thermally or photochemically to release the 

originated. The proteins identified in such a screen can isolated materials for MS analysis. 

function as markers for the changed state. For example, 25 Two types of PRG groups are specifically provided 

comparisons of protein expression profiles of normal and herein: (a) those groups that selectively react with a protein 

malignant cells can result in the identification of proteins functional group to form a covalent or non-covalent bond 

whose presence or absence is characteristic and diagnostic tagging the protein at specific sites, and (b) those that are 

of the malignancy. transformed by action of the protein, e.g., that are substrates 

In an exemplary embodiment, the methods herein can be 30 for an enzyme. In specific embodiments, PRG is a group 

employed to screen for changes in the expression or state of having specific reactivity for certain protein groups, such as 

enzymatic activity of specific proteins. These changes may specificity for sulfhydryl groups, and is useful in general for 

be induced by a variety of chemicals, including pharmaceu- selectively tagging proteins in complex mixtures. A sulfhy- 

tical agonists or antagonists, or potentially harmful or toxic dryl specific reagent tags proteins containing cysteine. In 

materials. The knowledge of such changes may be useful for 35 other specific embodiments, PRG is an enzyme substrate 

diagnosing enzyme-based diseases and for investigating that is selectively cleaved (leaving A-L) or modified (giving 

complex regulatory networks in cells. A-L-PRG') by the action of an enzyme of interest. 

The methods herein can also be used to implement a Exemplary reagents have the general formula: 
variety of clinical and diagnostic analyses to detect the 

J ° J t A-B X (CH *) TX (CH *) 1 X (CH ) X B 

presence, absence, deficiency or excess of a given protein or 40 pRG k 2jn v 2j K 2jp 

protein function in a biological fluid (e.g., blood), or in cells 

or tissue. The method is particularly useful in the analysis of where: 

complex mixtures of proteins, i.e., those containing 5 or a is the affinity label; 

more distinct proteins or protein functions. PRG is the protein reactive group; 

The inventive method employs affinity-labeled protein 45 X 1 , X 2 , X 3 and X 4 , independently of one another, and X 2 

reactive reagents that allow for the selective isolation of independently of other X 2 in the linker group, can be 

peptide fragments or the products of reaction with a given selected from O, S, NH, NR, NRR ,+ , CO, COO, COS, S— S, 

protein (e.g., products of enzymatic reaction) from complex SO, S0 2 , CO — NR', CS — NR', Si — O, aryl or diaryl groups 

mixtures. The isolated peptide fragments or reaction prod- or X 2 -X 4 may be absent, but preferably at least one of 

ucts are characteristic of the presence of a protein or the 50 X 1 — X 4 is present; 

presence of a protein function, e.g., an enzymatic activity, B 1 and B 2 , independently of one another, are optional 

respectively, in those mixtures. Isolated peptides or reaction moieties that can faciliate bonding of the A or PRG group to 

products are characterized by mass spectrometric (MS) the linker or prevent undesired cleavage of those groups 

techniques. In particular, the sequence of isolated peptides from the linker and can be selected, for example, from COO, 

can be determined using tandem MS (MS") techniques, and 55 CO, CO — NR', CS — NR' and may contain one or more CH 2 

by application of sequence database searching techniques, groups alone or in combination with other groups, e.g. 

the protein from which the sequenced peptide originated can (CH 2 ) q — CONR', (CU 2 ) q — CS — NR', or (CH 2 ) q ; 

be identified. The reagents also provide for differential n, m, p and q are whole numbers that can have values from 

isotopic labeling of the isolated peptides or reaction products 0 to about 100, preferably one of n, m, p or q is not 0 and 

which facilitates quantitative determination by mass spec- 60 x is also a whole number that can range from 0 to about 100 

trometry of the relative amounts of proteins in different where the sum of n+xm+p+q is preferably less than about 

samples. Also, the use of differentially isotopically-labeled 100 and more preferably less than about 20; 

reagents as internal standards facilitates quantitative deter- R is an alkyl, alkenyl, alkynyl, alkoxy or aryl group; and 

mination of the absolute amounts of one or more proteins or R' is a hydrogen, an alkyl, alkenyl, alkynyl, alkoxy or aryl 

reaction products present in the sample. 65 group. 

In general, the affinity labeled protein reactive reagents of One or more of the CH 2 groups of the linker can be 

this invention have three portions: an affinity label (A) optionally substituted with small (C1-C6) alkyl, alkenyl, or 
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alkoxy groups, an aryl group or can be substituted with affinity tagged and in addition tagged peptides originating 

functional groups that promote ionization, such as acidic or from different samples are differentially iso topically labeled, 

basic groups or groups carrying permanent positive or As described above, affinity labeled peptides are isolated, 

negative charge. One or more single bonds connecting CH 2 released from the capture reagent and analyzed by (LC/MS). 

groups in the linker can be replaced with a double or a triple 5 Peptides characteristic of their protein origin are sequenced 

bond. Preferred R and R' alkyl, alkenyl, alkynyl or alkoxy using MS" techniques allowing identification of proteins in 

groups are small having 1 to about 6 carbon atoms. the samples. The relative amounts of a given protein in each 

One or more of the atoms in the linker can be substituted sample is determined by comparing relative abundance of 

with a stable isotope to generate one or more substantially the ions generated from any differentially labeled peptides 

chemically identical, but isotopically distinguishable 10 originating from that protein. The method can be used to 

reagents. For example, one or more hydrogens in the linker assess relative amounts of known proteins in different 

can be substituted with deuterium to generate isotopically samples. Further, since the method does not require any 

heavy reagents. prior knowledge of the type of proteins that may be present 

In an exemplary embodiment the linker contains groups in the samples, it can be used to identify proteins which are 

that can be cleaved to remove the affinity tag. If a cleavable 15 present at different levels in the samples examined. More 

linker group is employed, it is typically cleaved after affinity specifically, the method can be applied to screen for and 

tagged peptides, substrates or reaction products have been identify proteins which exhibit differential expression in 

isolated using the affinity label together with the CR. In this cells, tissue or biological fluids. It is also possible to deter- 

case, any isotopic labeling in the linker preferably remains mine the absolute amounts of specific proteins in a complex 

bound to the protein, peptide, substrate or reaction product. 20 mixture. In this case, a known amount of internal standard, 

Linker groups include among others: ethers, polyethers, one for each specific protein in the mixture to be quantified, 

ether diamines, polyether diamines, diamines, amides, is added to the sample to be analyzed. The internal standard 

polyamides, polythioethers, disulfides, silyl ethers, alkyl or is an affinity tagged peptide that is identical in chemical 

alkenyl chains (straight chain or branched and portions of structure to the affinity tagged peptide to be quantified 

which may be cyclic), aryl, diaryl or alkyl-aryl groups. Aryl 25 except that the internal standard is differentially isotopically 

groups in linkers can contain one or more hetero atoms (e.g., labeled, either in the peptide or in the affinity tag portion, to 

N, O or S atoms). distinguish it from the affinity tagged peptide to be quanti- 

In one aspect, the invention provides a mass spectrometric fied. The internal standard can be provided in the sample to 
method for identification and quantitation of one or more be analyzed in other ways. For example, a specific protein or 
proteins in a complex mixture which employs affinity 30 set of proteins can be chemically tagged with an 
labeled reagents in which the PRG is a group that selectively isotopically-labeled affinity tagging reagent. A known 
reacts with certain groups that are typically found in pep- amount of this material can be added to the sample to be 
tides (e.g., sulfhydryl, amino, carboxy, homoserine lactone analyzed. Alternatively, a specific protein or set of proteins 
groups). One or more affinity labeled reagents with different may be labeled with heavy atom isotopes and then deriva- 
PRG groups are introduced into a mixture containing pro- 35 tized with an affinity tagging reagent, 
teins and the reagents react with certain proteins to tag them Also, it is possible to quantify the levels of specific 
with the affinity label. It may be necessary to pretreat the proteins in multiple samples in a single analysis 
protein mixture to reduce disulfide bonds or otherwise (multiplexing). In this case, affinity tagging reagent s used to 
facilitate affinity labeling. After reaction with the affinity derivatize proteins present in different affinity tagged pep- 
labeled reagents, proteins in the complex mixture are 40 tides from different samples can be selectively quantified by 
cleaved, e.g., enzymatically, into a number of peptides. This mass spectrometry. 

digestion step may not be necessary, if the proteins are In this aspect of the invention, the method provides for 

relatively small. Peptides that remain tagged with the affinity quantitative measurement of specific proteins in biological 

label are isolated by an affinity isolation method, e.g., fluids, cells or tissues and can be applied to determine global 

affinity chromatography, via their selective binding to the 45 protein expression profiles in different cells and tissues. The 

CR. Isolated peptides are released from the CR by displace- same general strategy can be broadened to achieve the 

ment of A or cleavage of the linker, and released materials proteome-wide, qualitative and quantitative analysis of the 

are analyzed by liquid chromatography/mass spectrometry state of modification of proteins, by employing affinity 

(LC/MS). The sequence of one or more tagged peptides is reagents with differing specificity for reaction with proteins, 

then determined by MS" techniques. At least one peptide 50 The method and reagents of this invention can be used to 

sequence derived from a protein will be characteristic of that identify low abundance proteins in complex mixtures and 

protein and be indicative of its presence in the mixture. can be used to selectively analyze specific groups or classes 

Thus, the sequences of the peptides typically provide suffi- of proteins such as membrane or cell surface proteins, or 

cient information to identify one or more proteins present in proteins contained within organelles, sub-cellular fractions, 

a mixture. 55 or biochemical fractions such as immunoprecipitates. 

Quantitative relative amounts of proteins in one or more Further, these methods can be applied to analyze differences 

different samples containing protein mixtures (e.g., biologi- in expressed proteins in different cell states. For example, 

cal fluids, cell or tissue lysates, etc.) can be determined using the methods and reagents herein can be employed in diag- 

chemically identical, affinity tagged and differentially iso- nostic assays for the detection of the presence or the absence 

topically labeled reagents to affinity tag and differentially 60 of one or more proteins indicative of a disease state, such as 

isotopically label proteins in the different samples. In this cancer. 

method, each sample to be compared is treated with a In a second aspect, the invention provides a MS method 

different isotopically labeled reagent to tag certain proteins for detection of the presence or absence of a protein 

therein with the affinity label. The treated samples are then function, e.g., an enzyme activity, in a sample. The method 

combined, preferably in equal amounts, and the proteins in 65 can also be employed to detect a deficiency or excess (over 

the combined sample are enzymatically digested, if normal levels) of protein function in a sample. Samples that 

necessary, to generate peptides. Some of the peptides are can be analyzed include various biological fluids and 
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materials, including tissue and cells. In this case, the PRG of ADH1 gene expression for yeast grown on ethanol or sugar, 

the affinity labeled reagent is a substrate for the enzyme of FIG. 5C shows the mass spectrum for ADH2 yeast gene 

interest. Affinity labeled substrates are provided for each expression for yeast grown on either ethanol or galactose, 

enzyme of interest and are introduced into a sample where FIGS. 6A-6C represent the clinical analysis of patient 

they react to generate affinity labeled products, if the enzyme 5 samples for GM1 and SFB. See "Clinical Enzymology 

of interest is present in the sample. Products or unreacted Assays." 

substrate that are tagged with the affinity label are isolated FIG. 7 is a schematic representation of the automated 

by an affinity isolation method, e.g., affinity LC-MS/MS system. 

chromatography, via their selective binding to the CR. The FIG. 8 is a schematic representation of the SPE 6 -CE-MS/ 

isolated tagged substrates and products are analyzed by mass 10 MS system. 

spectrometry. Affinity labeled products include those in DETAILED DESCRIPTION OF THE 

which the substrate is entirely cleaved from the linker or in INVENTION 

which the substrate is modified by reaction with a protein of . . . . 

interest. Detection of the affinity-labeled product indicates The methods of thls invention emp oy affinity tagged 

the protein function is present in the sample. Detection of 15 P rote , ln , reactlv ^ ™8 ents m which the affinity tag is 

little or no affinity labeled product indicates deficiency or ^ova kn'ly attached to a protein reactive group by a linker, 

absence, respectively, of the protein function in the sample. Tne can he , ^topically labe ed to generate pairs or 

The amount of selected protein, e.g., measured in terms of » ts of reagents that are substantially chemically identical, 

enzyme activity, present in a sample can be measured by h f whlch are distinguishable by mass. For example a pair 

introducing a known amount of an internal standard which 20 °* rea S ents : one of whl f, n "» .^topically heavy and the other 

is an isotopically labeled analog of the expected product of of whlch 15 f°topically light can be emp oyed for the 

the enzymatic reaction of the reagent substrate. The internal comparison of two samples one of which may be a reference 

standard is substantially chemically identical to the expected sam P le coaming one or more known proteins in known 

enzymatic reaction product, but is isotopically distinguish- amounts - For exam P le ' anv one or more u of , ll f M^ger., 

able therefrom. The level of protein function (e.g., enzy- 25 nltr °S en ' °%& n . or sulfur atoms in the linker may be 

matic activity) in a given sample can be compared with replaced with their isotopically stable isotopes: H, C, N, 

activity levels in other samples or controls (either negative ' or 

or positive controls). The procedure therefore can detect the Suitable affinity tags bind selectively either covalently or 

presence, absence, deficiency or excess of a protein function non-covalently and with high affinity to a capture reagent 

in a sample. The method is capable of quantifying the 30 ( CR )- The CR " A interaction or bond should remain intact 

velocity of an enzymatic reaction since it enables the amount after extensive and multiple washings with a variety of 

of product formed over a known time period to be measured. solutions to remove non-specifically bound components. 

This method can be multiplexed, by simultaneous use of a The affinit y ta § bmds minimally or preferably not at all to 

plurality of affinity labeled substrates selective for different components in the assay system, except CR, and does not 

protein functions and if quantitation is desired by inclusion 35 significantly bind to surfaces of reaction vessels. Any non- 

of the corresponding internal standards for expected specific interaction of the affinity tag with other components 

products, to analyze for a plurality of protein functions in a or surfaces should be disrupted by multiple washes that 

single sample. leave CR-A intact. Further, it must be possible to disrupt the 

interaction of A and CR to release peptides, substrates or 

BRIEF DESCRIPTION OF THE FIGURES 40 reaction products, for example, by addition of a displacing 

ligand or by changing the temperature or solvent conditions. 

FIG. 1 is a standard curve generated with a cysteine- Preferably? neither CR nor A react chemically with other 

biotinylated peptide and quantitated by stable isotope dilu- comp onents in the assay system and both groups should be 

tion. A) Zoom-scan from an ion-trap mass spectrometer chemica ii y stab l e over the time period of an assay or 

showing a 4 amu isotope distribution for the [M2H] ions experiment The affinity tag pre ferably does not undergo 

of the peptide modified with the isotopically light (1457.9 u) pe ptide-like fragmentation during (MS)" analysis. The affin- 

and heavy (1461.8) biotinylating reagents. The ratio (d0/d8) ity kbd is preferably soluble in the sample liquid to be 

was 4.54. B) Curve generated from the analysis of isotope ana i yzed and the C R should remain soluble in the sample 

ratios from zoom-scans of 5 different concentrations of Hquid eyen though attached to an inso luble resin such as 

dO-labeled peptide measured in the presence of a known Agarose . In the case of CR the term so i ub i e means that CR 

amount of peptide labeled with the isotopically heavy fa sufficien tly hydrated or otherwise solvated such that it 

reagent. functions properly for binding to A. CR or CR-containing 

FIG. 2 is a tandem mass spectrum of a cysteine -modified conjugates should not be present in the sample to be 

peptide from a-lactalbumin. analyzed, except when added to capture A. 

FIG. 3 shows a mass spectrum for a single differentially 55 Examples of A and CR pairs include: 

labeled peptide pair. FIG. 3A shows four pairs of peptide d-biotin or structurally modified biotin-based reagents, 

ions characterized by the mass differential encoded in the including d-iminobiotin, which bind to proteins of avidin/ 

affinity tagged reagent. FIG. 3B shows an expanded view of streptavidin, which may, for example, be used in the forms 

the mass spectrum around one ion pair. FIG. 3C shows the 0 f strep avidin-Agarose, oligo me ric-avidin- Agarose, or 

reconstructed ion chromatograms for each peak of the pair 60 monomeric-avidin-Agarose; 

in FIG - 2 - any 1,2-diol, such as 1,2-dihydroxyethane (HO— CH 2 — 

FIG. 4A shows the CID spectrum for one of the peaks of CH 2 — OH), and other 1,2-dihydroxyalkanes including those 

one of the peptides analyzed in FIG. 3. FIG. 4B shows the 0 f cyclic alkanes, e.g., 1,2-dihydroxycyclohexane which 

result of database searching the CID spectrum of FIG. 4A. bm d to an alkyl or aryl boronic acid or boronic acid esters 

FIG. 5 A shows the method by which yeast group on 65 , such as pheriyl-B(0H) 2 or hexyl-B(0Ethyl) 2 which may be 

ethanol by converting ethanol into acetaldehyde which attached via the alkyl or aryl group to a solid support 

enters the TCA cycle. FIG. 5B shows the mass spectrum for material, such as Agarose; 
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maltose which binds to maltose binding protein (as well linker preferably contains groups or moieties that facilitate 
as any other sugar/sugar binding protein pair or more ionization of the affinity tagged reagents, peptides, sub- 
generally to any ligand/ligand binding protein pairs that has strates or reaction products. 

properties discussed above); To promote ionization, the linker may contain acidic or 
a hapten, such as dinitrophenyl group, for any antibody 5 basic groups, e.g., COOH, S0 3 H, primary, secondary or 
where the hapten binds to an anti-hapten antibody that tertiary amino groups, nitrogen-heterocycles, ethers, or corn- 
recognizes the hapten, for example the dinitrophenyl group binations of these groups. The linker may also contain 
will bind to an anti-dinitrophenyl-IgG; * § rou P s havin § a Permanent char § e > e -§;> phosphomum 

. . , , , groups, quaternary ammonium groups, sulfomum groups, 

a hgand which binds to a transition metal, for example, an chekted metal ions> tetralky Qr tetraryl borate Qr staWe 

ohgomenc histidine will bind to Ni(II), the transition metal IU car banions 

CR may be used in the form of a resin bound chelated The covalent bond of the Hnker tQ A Qr pRG shouM 

transition metal, such as mtnlotnacetic acid-chelated Ni(II) typically not be unintentionally cleaved by chemical or 

or iminodiacetic acid-chelated Ni(II); enzymatic reactions during the assay. In some cases it may 

glutathione which binds to glutathione -S -transferase. be desirable to cleave the linker from the affinity tag A or 

In general, any A-CR pair commonly used for affinity from the PRG, for example to facilitate release from an 

enrichment which meets the suitability criteria discussed affinity column. Thus, the linker can be cleavable, for 

above. Biotin and biotin-based affinity tags are preferred. Of example, by chemical, thermal or photochemical reaction, 

particular interest are structurally modified biotins, such as Photocleavable groups in the linker may include the l-(2- 

d-iminobiotin, which will elute from avidin or strepavidin nn mtrophenyl) -ethyl group. Thermally labile linkers may, for 

columns under solvent conditions compatible with ESI-MS 2 ° exam P le ' be a double -stranded duplex formed from two 

! • u -,m . + • • m o<w • complementary strands oi nucleic acid, a strand or a nucleic 

analysis, such as dilute acids containing 10-20% organic • -. i . . c . - i • • 1 

* . y . , j 1( , , . . , . , i j acid with a complementary strand or a peptide nucleic acid, 

solvent. It is expected that d-iminobiotin tagged compounds , , r . ' .-, i • -j * j u- u 

.„ 1 . r , , , tt „ t x • i • • 1 or two complementary peptide nucleic acid strands which 

will elute in solvents below pH 4. d-Iminobiotin tagged w{n dissociate heati cleavable linkers also include 

protein reactive reagents can be synthesized by methods 25 those having disulfide bonds? add Qr base kbfle groups? 

described herein for the corresponding biotin tagged including among others, diarylmethyl or trimethylarylm- 

reagents. em yl groups, silyl ethers, carbamates, oxyesters, thiesters, 

A displacement ligand, DL, is optionally used to displace thionoesters, and a-fluorinated amides and esters. Enzymati- 

A from CR. Suitable DLs are not typically present in cally cleavable linkers can contain, for example, protease- 

samples unless added. DL should be chemically and enzy- 30 sensitive amides or esters, p -lactam ase-sensitive p -lactam 

matically stable in the sample to be analyzed and should not analogs and linkers that are nuclease-cleavable, or 

react with or bind to components (other than CR) in samples glycosidase-cleavable. 

or bind non-specifically to reaction vessel walls. DL pref- The protein reactive group (PRG) can be a group that 

erably does not undergo peptide-like fragmentation during selectively reacts with certain protein functional groups or is 

MS analysis, and its presence in a sample should not 35 a substrate of an enzyme of interest. Any selectively reactive 

significantly suppress the ionization of tagged peptide, sub- protein reactive group should react with a functional group 

strate or reaction product conjugates. of interest that is present in at least a portion of the proteins 

DL itself preferably is minimally ionized during mass in a sample. Reaction of PRG with functional groups on the 

spectrometric analysis and the formation of ions composed P rotem should occur under conditions that do not lead to 

of DL clusters is preferably minimal. The selection of DL, 4 o substantial degradation of the compounds in the sample to 

depends upon the A and CR groups that are employed. In be analyzed. Examples of selectively reactive PRCs suitable 

general, DL is selected to displace A from CR in a reason- for use in the affinit y ta §§ ed reagents of this invention, 

able time scale, at most within a week of its addition, but include those whlch react Wlth sulfhydryl groups to tag 

more preferably within a few minutes or up to an hour. The proteins containing cysteine, those that react with amino 

affinity of DL for CR should be comparable or stronger than 4 5 § rou P s ' car boxylate groups, ester groups, phosphate reactive 

the affinity of the tagged compounds containing A for CR. § rou P s > and aldehyde and/or ketone reactive groups or, after 

Furthermore, DL should be soluble in the solvent used fragmentation with CNBr, with homoserine lactone, 

during the elution of tagged compounds containing A from Th iol reactive groups include epoxides, cx-haloacyl group, 

CR. DL preferably is free A or a derivative or structural nitriles, sulfonated alkyl or aryl thiols and maleimides. 

modification of A. Examples of DL include, d-biotin or 50 Amino reactive groups tag amino groups in proteins and 

d-biotin derivatives, particularly those containing groups include sulfonyl halides, isocyanates, isothiocyanantes, 

that suppress cluster formation or suppress ionization in MS. active esters, including tetrafluorophenyl esters, and 

The linker group (L) should be soluble in the sample N-hydroxysuccinimidyl esters, acid halides, and acid any- 
liquid to be analyzed and it should be stable with respect to hydrides. In addition, amino reactive groups include aide- 
chemical reaction, e.g., substantially chemically inert, with 55 h y des or ket °nes in the presence or absence of NaBH 4 or 
components of the sample as well as A and CR groups. The JNaciNli^. 

linker when bound to A should not interfere with the specific Carboxylic acid reactive groups include amines or alco- 

interactionofA with CR or interfere with the displacement hols in the presence of a coupling agent such as 

of A from CR by a displacing ligand or by a change in dicyclohexylcarbodiimide, or 2,3,5,6-tetrafluorophenyl trif- 

temperature or solvent. The linker should bind minimally or 60 luoroacetate and in the presence or absence of a coupling 

preferably not at all to other components in the system, to catalyst such as 4-dimethylaminopyridine; and transition 

reaction vessel surfaces or CR. Any non-specific interactions metal-diamine complexes including Cu(II)phenanthroline 

of the linker should be broken after multiple washes which Ester reactive groups include amines which, for example, 

leave the A-CR complex intact. Linkers preferably do not react with homoserine lactone. 

undergo peptide-like fragmentation during (MS)" analysis. 65 Phosphate reactive groups include chelated metal where 

At least some of the atoms in the linker groups should be the metal is, for example Fe(III) or Ga(III), chelated to, for 

readily replaceable with stable heavy-atom isotopes. The example, nitrilotriacetiac acid or iminodiacetic acid. 
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Aldehyde or ketone reactive groups include amine plus 
NaBH 4 or NaCNBH 3 , or these reagents after first treating a 
carbohydrate with periodate to generate an aldehyde or 
ketone. 

PRG groups can also be substrates for a selected enzyme 
of interest. The enzyme of interest may, for example, be one 
that is associated with a disease state or birth defect or one 
that is routinely assayed for medical purposes. Enzyme 
substrates of interest for use with the methods of this 
invention include, acid phosphatase, alkaline phosphatase, 
alanine aminotransferase, amylase, angiotensin converting 
enzyme, aspartate aminotransferase, creatine kinase, 
gamma-glutamyltransferase, lipase, lactate dehydrogenase, 
and glucose-6-phosphate dehydrogenase which are currently 
routinely assayed by other methods. 

Internal standards, which are appropriately isotopically 
labeled, may be employed in the methods of this invention 
to measure absolute quantitative amounts of proteins in 
samples. Internal standards are of particular use in assays 
intended to quantitate affinity tagged products of enzymatic 
reactions. In this application, the internal standard is chemi- 
cally identical to the tagged enzymatic product generated by 
the action of the enzyme on the affinity tagged enzyme 
substrate, but carries isotope labels which may include 2 H, 
13 C, 15 N, 17 0, 18 0, or 34 S, that allow it to be independently 
detected by MS techniques. Internal standards for use in the 
method herein to quantitate one or several proteins in a 
sample are prepared by reaction of affinity labeled protein 
reactive reagents with a known protein to generate the 
affinity tagged peptides generated from digestion of the 
tagged protein. Affinity tagged peptides internal standards 
are substantially chemically identical to the corresponding 
affinity tagged peptides generated from digestion of affinity 
tagged protein, except that they are differentially isotopi- 
cally labeled to allow their independent detection by MS 
techniques. 

The method of this invention can also be applied to 
determine the relative quantities of one or more proteins in 
two or more protein samples, the proteins in each sample are 
reacted with affinity tagging reagents which are substantially 
chemically identical but differentially isotopically labeled. 
The samples are combined and processed as one. The 
relative quantity of each tagged peptide which reflects the 
relative quantity of the protein from which the peptide 
originates is determined by the measurement of the respec- 
tive isotope peaks by mass spectrometry. 

The methods of this invention can be applied to the 
analysis or comparison of multiple different samples. 
Samples that can be analyzed by methods of this invention 
include cell homogenates; cell fractions; biological fluids 
including urine, blood, and cerebrospinal fluid; tissue homo- 
genates; tears; feces; saliva; lavage fluids such as lung or 
peritoneal ravages; mixtures of biological molecules includ- 
ing proteins, lipids, carbohydrates and nucleic acids gener- 
ated by partial or complete fractionation of cell or tissue 
homogenates. 

The methods of this invention employ MS and (MS)" 
methods. While a variety of MS and (MS)" are available and 
may be used in these methods, Matrix Assisted Laser 
Desorption Ionization MS (MALDI/MS) and Electrospray 
Ionization MS (ESI/MS) methods are preferred. 
Quantitative Proteome Analysis 

This method is schematically illustrated in Scheme 1 using 
a biotin labeled sulfhydryl-re active reagent for quantitative 
protein profile measurements in a sample protein mixture 
and a reference protein mixture. The method comprises the 
following steps: 
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Reduction. Disulfide bonds of proteins in the sample and 
reference mixtures are reduced to free SH groups. The 
preferred reducing agent is tri-n-butylphosphine which is 
used under standard conditions. Alternative reducing agents 

5 include mercaptoethylamine and dithiothreitol. If required, 
this reaction can be performed in the presence of solubiliz- 
ing agents including high concentrations of urea and deter- 
gents to maintain protein solubility. The reference and 
sample protein mixtures to be compared are processed 

10 separately, applying identical reaction conditions; 

Derivatization of SH groups with an affinity tag. Free SH 
groups are derivatized with the biotinylating reagent 
biotinyl-iodoacetylamidyl-4,7,10 trioxatridecanediamine 
the synthesis of which is described below. The reagent is 

15 prepared in different isotopically labeled forms by substitu- 
tion of linker atoms with stable isotopes and each sample is 
derivatized with a different isotopically labeled form of the 
reagent. Derivatization of SH groups is preferably per- 
formed under slightly basic conditions (pH 8.5) for 90 min 

20 at RT. For the quantitative, comparative analysis of two 
samples, one sample each (termed reference sample and 
sample) are derivatized as illustrated in Scheme 1 with the 
isotopically light and the isotopically heavy form of the 
reagent, respectively. For the comparative analysis of sev- 

25 eral samples one sample is designated a reference to which 
the other samples are related to. Typically, the reference 
sample is labeled with the isotopically heavy reagent and the 
experimental samples are labeled with the isotopically light 
form of the reagent, although this choice of reagents is 

30 arbitrary. These reactions are also compatible with the 
presence of high concentrations of solubilizing agents; 

Combination of labeled samples. After completion of the 
affinity tagging reaction defined aliquots of the samples 
labeled with the isotopically different reagents (e.g., heavy 

35 and light reagents) are combined and all the subsequent 
steps are performed on the pooled samples. Combination of 
the differentially labeled samples at this early stage of the 
procedure eliminates variability due to subsequent reactions 
and manipulations. Preferably equal amounts of each sample 

40 are combined; 

Removal of excess affinity tagged reagent. Excess reagent 
is adsorbed, for example, by adding an excess of 
SH-containing beads to the reaction mixture after protein SH 
groups are completely derivatized. Beads are added to the 

45 solution to achieve about a 5-fold molar excess of SH groups 
over the reagent added and incubated for 30 min at RT. After 
the reaction the beads are be removed by centrifugation; 

Protein digestion. The proteins in the sample mixture are 
digested, typically with trypsin. Alternative proteases are 

50 also compatible with the procedure as in fact are chemical 
fragmentation procedures. In cases in which the preceding 
steps were performed in the presence of high concentrations 
of denaturing solubilizing agents the sample mixture is 
diluted until the denaturant concentration is compatible with 

55 the activity of the proteases used. This step may be omitting 
in the analysis of small proteins; 

Affinity isolation of the affinity tagged peptides by inter- 
action with a capture reagent. The biotinylated peptides are 
isolated on avidin-agarose. After digestion the pH of the 

60 peptide samples is lowered to 6.5 and the biotinylated 
peptides are immobilized on beads coated with monomeric 
avidin (Pierce). The beads are extensively washed. The last 
washing solvent includes 10% methanol to remove residual 
SDS. Biotinylated peptides are eluted from avidin-agarose, 

65 for example, with 0.3% formic acid at pH 2; 

Analysis of the isolated, derivatized peptides by juLC- 
MS n or CE-MS" with data dependent fragmentation. Meth- 
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ods and instrument control protocols well-known in the art 
and described, for example, in Ducret et al., 1998; Figeys 
and Aebersold, 1998; Figeys et al., 1996; or Haynes et al., 
1998 are used. 

In this last step, both the quantity and sequence identity of 5 
the proteins from which the tagged peptides originated can 
be determined by automated multistage MS. This is 
achieved by the operation of the mass spectrometer in a dual 
mode in which it alternates in successive scans between 
measuring the relative quantities of peptides eluting from the 10 
capillary column and recording the sequence information of 
selected peptides. Peptides are quantified by measuring in 
the MS mode the relative signal intensities for pairs of 
peptide ions of identical sequence that are tagged with the 
isotopically light or heavy forms of the reagent, respectively, 15 
and which therefore differ in mass by the mass differential 
encoded within the affinity tagged reagent. Peptide sequence 
information is automatically generated by selecting peptide 
ions of a particular mass-to-charge (m/z) ratio for collision- 
induced dissociation (CID) in the mass spectrometer oper- 20 
ating in the MS" mode. (Link, A. J. et al., 1997; Gygi, S. P., 
et al. 1999; and Gygi, S. P. et al., 1999). The resulting CID 
spectra are then automatically correlated with sequence 
databases to identify the protein from which the sequenced 
peptide originated. Combination of the results generated by 25 
MS and MS" analyses of affinity tagged and differentially 
labeled peptide samples therefore determines the relative 
quantities as well as the sequence identities of the compo- 
nents of protein mixtures in a single, automated operation. 

Results of applying this method using the biotinylated 30 
sulfhydryl reagent and to the quantitative analysis of syn- 
thetic peptide samples, to the relative quantitation of the 
peptides in a protein digest and the tandem mass spectral 
analysis of a denvatized peptide are shown in FIG. 1, Table 
1, and FIG. 2, respectively. 35 

This method can also be practiced using other affinity tags 
and other protein reactive groups, including amino reactive 
groups, carboxyl reactive groups, or groups that react with 
homoserine lactones. 

The approach employed herein for quantitative proteome 40 
analysis is based on two principles. First, a short sequence 
of contiguous amino acids from a protein (5-25 residues) 
contains sufficient information to uniquely identify that 
protein. Protein identification by MS" is accomplished by 
correlating the sequence information contained in the CID 45 
mass spectrum with sequence databases, using sophisticated 
computer searching algorithms (Eng, J. et al., 1994; Mann, 
M. et al., 1994; Qin, J. et al, 1997; Clauser, K. R. et al., 
1995). Second, pairs of identical peptides tagged with the 
light and heavy affinity tagged reagents, respectively, (or in 50 
analysis of more than two samples, sets of identical tagged 
peptides in which each set member is differentially isotopi- 
cally labeled) are chemically identical and therefore serve as 
mutual internal standards for accurate quantitation. The MS 
measurement readily differentiates between peptides origi- 55 
nating from different samples, representing for example 
different cell states, because of the difference between 
isotopically distinct reagents attached to the peptides. The 
ratios between the intensities of the differing weight com- 
ponents of these pairs or sets of peaks provide an accurate 60 
measure of the relative abundance of the peptides (and hence 
the proteins) in the original cell pools because the MS 
intensity response to a given peptide is independent of the 
isotopic composition of the reagents (De Leenheer, A. P. et 
al (1992). The use of isotopically labeled internal standards 65 
is standard practice in quantitative mass spectrometry and 
has been exploited to great advantage in, for example, the 
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precise quantitation of drugs and metabolites in bodily fluids 
(De Leenheer, A. P. et al., 1992). 

In another illustration of the method, two mixtures con- 
sisting of the same six proteins at known, but different, 
concentrations were prepared and analyzed. The protein 
mixtures were labeled, combined and treated as schemati- 
cally illustrated in Scheme 1. The isolated, tagged peptides 
were quantified and sequenced in a single combined /iLC- 
MS and /iLC-MS" experiment on an ESI ion trap mass 
spectrometer. All six proteins were unambiguously identi- 
fied and accurately quantified (Table 2). Multiple tagged 
peptides were encountered for each protein. The differences 
between the observed and expected quantities for the six 
proteins ranged between 2 and 12%. 

The process is further illustrated for a single peptide pair 
in FIGS. 3A-C. A single scan of the mass spectrometer 
operated in MS mode is shown in FIG. 3A. Four pairs of 
peptide ions characterized by the mass differential encoded 
in the affinity tagged reagent are detected in this scan and 
indicated with their respective m/z values. The scan shown 
was acquired in 1.3 s. Over the course of the one-hour 
chromatographic elution gradient, more than 1200 such 
scans were automatically recorded. FIG. 3B shows an 
expanded view of the mass spectrum around the ion pair 
with m/z ratios of 993.8 and 977.7, respectively. Co-elution 
and a detected mass differential of four units potentially 
identifies the ions as a pair of doubly charged affinity tagged 
peptides of identical sequence (mass difference of eight and 
a charge state of two). FIG. 3C shows the reconstructed ion 
chromatograms for these two species. The relative quantities 
were determined by integrating the contour of the respective 
peaks. The ratio (light/heavy) was determined as 0.54 (Table 
1). The peaks in the reconstructed ion chromatograms 
appear serrated because in every second scan the mass 
spectrometer switched between the MS and the MS" modes 
to collect sequence information (CID mass spectrum) of a 
selected peptide ion. These CID spectra were used to iden- 
tify the protein from which the tagged peptides originated. 
FIG. 4A shows the CID spectrum recorded from the peptide 
ion with m/z=998 (marked with an arrow in FIG. 3A). 
Database searching with this CID spectrum identified the 
protein as glyceraldehyde-3-phosphate dehydrogenase (FIG. 
4B) which was a member of the protein mixture. 

Several beneficial features of the this method are appar- 
ent. First, at least two peptides were detected from each 
protein in the mixture. Therefore, both quantitation and 
protein identification can be redundant. Second, the identi- 
fied peptides all contained at least one tagged cysteinyl 
residue. The presence of the relatively rare cysteinyl residue 
in a peptide adds an additional powerful constraint for 
database searching (Sechi, S. et al., 1998). Third, tagging 
and selective enrichment of cysteine-containing peptides 
significantly reduced the complexity of the peptide mixture 
generated by the concurrent digestion of six proteins. For 
this protein mixture, the complexity was reduced from 293 
potential tryptic peptides to 44 tryptic peptides containing at 
least one cysteinyl residue. Fourth, the peptide samples 
eluted from the avidin affinity column are directly compat- 
ible with analysis by //LC-MS". 

Quantitative Analysis of Protein Expression in Different Cell 
States 

The protein reactive affinity reagent strategy was applied 
to study differences in steady-state protein expression in the 
yeast, S. cerevisiae, in two non-glucose repressed states 
(Table 3). Cells were harvested from yeast growing in 
log-phase utilizing either 2% galactose or 2% ethanol as the 
carbon source. One-hundred fig of soluble yeast protein 
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from each cell state were labeled independently with the cysteinyl residue. Thus, the complexity of the mixture is 

isotopically different affinity tagged reagents. The labeled reduced, while protein quantitation and identification are 

samples were combined and subjected to the strategy still achieved. The chemical reaction of the sulfhydryl 

described in Scheme f . One fiftieth (the equivalent of reagent with protein can be performed in the presence of 

approximately 2 jug of protein from each cell state) of the 5 urea, sodium dodecyl sulfate (SDS), salts and other chemi- 

sample was analyzed. cals that do not contain a reactive thiol group. Therefore, 

Glucose repression causes large numbers of proteins with proteins can be kept in solution with powerful stabilizing 
metabolic functions significant to growth on other carbon agents until they are enzymatically digested. The sensitivity 
sources to be minimally expressed (Ronne, H., f 995; of the /iLC-MS" system is dependent of the sample quality. 
Hodges, P. E. et al., 1 999). Growth on galactose or ethanol 10 In particular, commonly used protein solubilizing agents are 
with no glucose present results in the expression of glucose poorly compatible or incompatible with MS. Affinity puri- 
repressed genes. Table 3 presents a selection of 34 yeast fication of the tagged peptides completely eliminates con- 
genes encountered in the analysis, but it contains every taminants incompatible with MS. The quantitation and iden- 
known glucose-repressed genes that was identified (Mann, tification of low abundance proteins by conventional 
M. et al., 1994). Each of these genes would have been 15 methods requires large amounts (milligrams) of starting 
minimally expressed in yeast grown on glucose. Genes protein lysate and involves some type of enrichment for 
specific to both growth on galactose (GAL1, GAL10) as these low abundance proteins. Assays described above, start 
we 11 as growth on ethanol (ADH2,ACH1) were detected and with about 100 jug of protein and used no fractionation 
quantitated. techniques. Of this, approximately Vso of the protein was 

The quantitative nature of the method is apparent in the 20 analyzed in a single juLC-MS n experiment. This system has 
ability to accurately measure small changes in relative a limit of detection of 10-20 fmol per peptide (Gygi, S. P. 
protein levels. Evidence of the accuracy of the measure- et al. (1999)). For this reason, in the assays described which 
ments can be seen by the excellent agreement found by employ juLC-MS n only abundant proteins are detected, 
examining ratios for proteins for which multiple peptides However, the methods of this invention are compatible with 
were quantified. For example, the five peptides found from 25 any biochemical, immunological or cell biological fraction- 
PCK1 had a mean ratio ±95% confidence intervals of ation methods that reduce the mixture complexity and enrich 
1.57±0.15, and the percent error was <10%. In addition, the for proteins of low abundance while quantitation is main- 
observed changes fit the expected changes from the litera- tained. This method can be redundant in both quantitatian 
ture (Ronne, H., 1995; Hodges, P. E. et al., 1999). Finally, and identification if multiple cysteines are detected. There is 
the observed changes are in agreement with the changes in 30 a dynamic range associated with the ability of the method to 
staining intensity for these same proteins examined after quantitate differences in expression levels of affinity tagged 
two-dimensional gel electrophoresis (data not shown). peptides which is dependent on both the intensity of the 

The alcohol dehydrogenase family of isozymes in yeast peaks corresponding the peptide pair (or set) and the overall 

facilitates growth on either hexose sugars (ADH1) and mixture complexity. In addition, this dynamic range will be 

ethanol (ADH2). The gene ADH2 encodes an enzyme that is 35 different for each type of mass spectrometer used. The ion 

both glucose- and galactose-repressed and permits a yeast trap was employed in assays described herein because of its 

cell to grow entirely on ethanol by converting it into ability to collect impressive amounts of sequencing infor- 

acetaldehyde which enters the TCA cycle (FIG. 5A). In the mation (thousands of proteins can potentially be identified) 

presence of sugar, ADH1 performs the reverse reaction in a data-dependent fashion even though it offers a more 

converting acetaldehyde into ethanol. The regulation of 40 limited dynamic quantitation range. The dynamic range of 

these isozymes is key to carbon utilization in yeast (Ronne, the ion trap (based on signal-to-noise ratios) varied depend- 

H., 1995). The ability to accurately measure differences in ing on the signal intensity of the peptide pair and complexity 

gene expression across families of isozymes is sometimes of the mixture, but differences of up to 100-fold were 

difficult using cDNA array techniques because of cross generally detectable and even larger differences could be 

hybridization (DeRisi, J. L. et al., 1997). The method of this 45 determined for more abundant peptides. In addition, protein 

invention applied as illustrated in FIG. 1 succeeded in expression level changes of more than 100-200-fold still 

measuring gene expression for each isozyme even though identify those proteins as major potential contributors to the 

ADH1 and ADH2 share 93% amino acid (88% nucleotide) phenotypic differences between the two original cell states, 

sequence similarity. This was because the affinity tagged The method can be extended to include reactivity toward 

peptides from each isozyme differed by a single amino acid 50 other functional groups. A small percentage of proteins (8% 

residue (valine to threonine) which shifted the retention time for S. cerevisiae) contain no cysteinyl residues and are 

by more than 2 min and the mass by 2 daltons for the ADH2 therefore missed by analysis using reagents with sulfhydryl 

peptides (FIG. 5B). ADH1 was expressed at approximately group specificity (i.e., thiol group specificity). Affinity 

2-fold high levels when galactose was the carbon source tagged reagents with specificities toward functional groups 

compared with ethanol. Ethanol-induction of ADH2 expres- 55 other than sulfhydryl groups will also make cysteine -free 

sion resulted in more than 200-fold increases compared with proteins susceptible to analysis. 

galactose-induction. The methods of this invention can be applied to analysis 

The results described above illustrate that the method of of low abundance proteins and classes of proteins with 

this invention provides quantitative analysis of protein mix- particular physico-chemical properties including poor 

tures and the identification of the protein components therein 60 solubility, large or small size and extreme p/values. 

in a single, automated operation. The prototypical application of the chemistry and method 

The method as applied using a sulfhydryl reactive reagent is the establishment of quantitative profiles of complex 

significantly reduces the complexity of the peptide mixtures protein samples and ultimately total lysates of cells and 

because affinity tagged cysteine -containing peptides are tissues following the preferred method described above. In 

selectively isolated. For example, a theoretical tryptic digest 65 addition the reagents and methods of this invention have 

of the entire yeast proteome (6113 proteins) produces 344, applications which go beyond the determination of protein 

855 peptides, but only 30,619 of these peptides contain a expression profiles. Such applications include the following: 
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Application of amino-reactive or sulfhydryl-re active, dif- 
ferentially isotopically labeled affinity tagged reagents for 
the quantitative analysis of proteins in immuno precipitated 
complexes. In the preferred version of this technique protein 
complexes from cells representing different states (e.g., 5 
different states of activation, different disease states, differ- 
ent states of differentiation) are precipitated with a specific 
reagent, preferably an antibody. The proteins in the precipi- 
tated complex are then derivatized and analyzed as above. 

Application of amino-reactive, differentially isotopically 10 
labeled affinity tagged reagents to determine the sites of 
induced protein phosphorylation. In a preferred version of 
this method purified proteins (e.g., immunoprecipitated from 
cells under different stimulatory conditions) are fragmented 



cal instruments such as mass spectrometers. The application 
of the methods of the present invention to the analysis of 
membrane proteins is exemplified using human T cell lym- 
phoma cell line Jurkat for membrane protein labeling and 
extraction and the well characterized human prostate epi- 
thelial cell line P69SV40T and two P69SV40T sublines 
which differ in IGF-1 receptor expression by factor of 10 to 
exemplify quantitative, differential analysis of membrane 
proteins. 

Jurkat cells are an appropriate model system because the 
cells are easy to grow in large numbers and because the 
modulation of cell surface proteins in response to different 
stimuli and experimental conditions has been well charac- 
terized in T lymphocytes. Commercially available biotiny- 



and derivatized as described above. Phosphopeptides are 15 lating reagents or more generally affinity tagging reagents 



identified in the resulting peptide mixture by fragmentation 
in the ion source of the ESI-MS instrument and their relative 
abundances are determined by comparing the ion signal 
intensities of the experimental sample with the intensity of 
an included, isotopically labeled standard. 

Amino-reactive, differentially isotopically labeled affinity 
tagged reagents are used to identify the N-terminal ion series 
in MS" spectra. In a preferred version of this application, the 
peptides to be analyzed are derivatized with a 50:50 mixture 



are employed to derivatize lysine residues and the free 
N-termini. Water soluble biotinylating reagents such as 
Sulfo-NHS (N-hydroxy succinimide) biotin and analogs 
(Sulfosuccinimidyl-6-(biotinamido)-hexanoate, Pierce, 
20 Rockford, 111.) which have been used extensively for label- 
ing cell surface proteins can be employed. The reaction of 
NHS esters with primary amines is best at neutral pH values 
and above and is compatible with the presence of organic 
solvent such as DMSO or DMF. Biotinylation of cell surface 



of an isotopically light and heavy reagent which is specific 25 proteins from the Jurkat cells is carried out in PBS buffer at 



for amino groups. Fragmentation of the peptides by CID 
therefore produces two N-terminal ion series which differ in 
mass precisely by the mass differential of the reagent species 
used. This application dramatically reduces the difficulty in 



pH 7.2. Cells (lxlO 7 ) are washed with PBS buffer to remove 
contaminating serum and other proteins from the culture 
medium. The cells are resuspended at 25xl0 6 cell/ml and 
reacted with 0.5 mg/ml of Sulfo-NHS-Biotin (Pierce, 



determining the amino acid sequence of the derivatized 30 Rockford, 111.) for 30 min at RT. The labeled cells are 



peptide. 

Quantitative Analysis of Surface Proteins in Cells and Tissue 
The cell exterior membrane and its associated proteins 
(cell surface proteins) participate in sensing external signals 



washed twice with cold PBS to remove unreacted biotiny- 
lating reagent. Biotinylated cells are solubilized at 5xl0 7 
cells/ml in lysis buffer containing 1% Triton X-114. Triton 
X-114 has the property of phase-partitioning into detergent 



and responding to environmental cues. Changes in the 35 phase and aqueous phase at 30° C. Following the phase 



abundance of cell surface proteins can reflect a specific 
cellular state or the ability of a cell to respond to its changing 
environment. Thus, the comprehensive, quantitative charac- 
terization of the protein components of the cell surface can 



partitioning, detergent phase is removed from the aqueous 
phase by centrifugation at 300xg. Phase partitioning has 
previously been successfully used to enrich cell membrane. 
Also, this technique was found to enrich membrane proteins 



identify marker proteins or constellations of marker proteins 40 from Jurkat cell lysates. Triton phase is diluted 1:5 (v/v) 



characteristic for a particular cellular state, or explain the 
molecular basis for cellular responses to external stimuli. 
Indeed, changes in expression of a number of cell surface 
receptors such as Her2/neu, erbB, IGF1 receptor, and EGF 



using 50 mM ammonium bicarbonate buffer, pH 8.5, and 
high-purity, modified porcine-trypsin is added to digest the 
proteins at a concentration of 12.5 ng/ml for overnight at 37° 
C. Trypsin is neutralized by the addition of a cocktail of 



receptor have been implicated in carcinogenesis and a 45 serine protease inhibitors and tryptic peptides are isolated by 



current immunological therapeutic approach for breast can- 
cer is based on the infusion of an antibody (Herceptin, 
Genentech, Palo Alto, Calif.) that specifically recognizes 
Her2/neu receptor. 

Cell surface proteins are also experimentally accessible. 50 
Diagnostic assays for cell classification and preparative 
isolation of specific cells by methods such as cell sorting or 
panning are based on cell surface proteins. Thus, differential 
analysis of cell surface proteins between normal and dis- 



the avidin affinity chromatography techniques. Eluted pep- 
tides are separated e.g., by juLC methods and identified by 
searching peptide sequence databases, using for example, 
the Sequest program. 

The human prostate epithelial cell line P69SV40T which 
was immortalized with SV 40 T antigen has been well 
characterized. This cell line is immortal but not tumorigenic 
and expresses type 1 insulin like growth factor receptor 
(IGF-1R) at 2xl0 4 receptors per cell. A subline, called M12, 



eased (e.g., cancer) cells can identify important diagnostic or 55 was derived from P69SV40T by sequential passage in male 



therapeutic targets. While the importance of cell surface 
proteins for diagnosis and therapy of cancer has been 
recognized, membrane proteins have been difficult to ana- 
lyze. Due to their generally poor solubility they tend to be 
under-represented in standard 2D gel electrophoresis pat- 
terns and attempts to adapt 2D electrophoresis conditions to 
the separation of membrane proteins have met limited 
success. The method of this invention can overcome the 
limitations inherent in the traditional techniques. 

The analysis of membrane proteins is challenging because 65 
they generally are difficult to maintain in solution under 
conditions that are compatible with high sensitivity analyti- 



athymic nude mice. This cell line is highly tumorigenic and 
metastatic and expresses 1.1 xlO 3 IGF-1R per cell. The 
relative difference in the abundance of IGF-1R in the cell 
lines P69SV40T and Ml 2 can be quantitatively determined 
60 using methods of this invention adapted for application to 
membrane proteins. Since the number of IGF-1R for these 
cell lines has already been determined, this well character- 
ized system can provide a reference to validate the efficiency 
of the quantitative methods of this invention. 

P69SV40T cells (lxlO 7 ) are biotinylated with an isoto- 
pically heavy biotin tagged amino reactive reagent and the 
M12 cells (lxlO 7 ) are biotinylated with a corresponding 
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isotopically light amine reactive biotin tagged amino reac- 
tive reagent. IGF-1R is then immunoprecipitated from the 
combined lysate of both cell lines using an antibody against 
human IGF-1R and the total mass of immunoprecipitated 
proteins is digested with trypsin. Trypsin is then neutralized, 
e.g., by the addition of inhibitors and tagged peptides are 
purified by biotin-avidin affinity chromatography. The 
eluted peptides are analyzed by LC-MS and LC-MS" for 
peptide quantitation and identification, respectively, as has 
been described above. Quantitation in this experiment is 
facilitated by the option to use selective ion monitoring in 
the MS. In this mode only the masses of tagged peptide ions 
expected to derive from IGF-1R need be monitored. 

The described technique can be applied to compare the 
differences in relative abundance of cell surface proteins 
between parental prostate cell line (P69SV40T) and M12 
cells to detect and identify those cell surface proteins whose 
expression level is different in the two cell lines and which 
may be characteristic of the different cell states. Using the 
methods described herein, global, relative quantitation of the 
cell surface proteins in any two or more cell lines can be 
analyzed to detect and identify those cell surface proteins 
characteristic of the different cell states. Results can be 
independently confirmed using procedure such as ID or 2D 
gels, if applicable, or quantitative western blotting to con- 
firm quantitation results. 

It is expected that the experimental variability of quanti- 
tation of cell surface proteins will be considerably better 
than the accuracy of quantitation achieved by currently 
available cDNA array technology. In addition to relative 
protein quantity and identity, the method can also be used to 
reveal the orientation of the protein in the membrane, based 
on the presumption that intact, alive cells will exclude the 
biotinylating reagent. 

Alternative methods can be applied to enhance the selec- 
tivity for tagged peptides derived from cell surface proteins. 
For example, tagged cell surface proteins can be trypsinized 
directly on the intact cells to generate tagged peptides, 
purified and analyzed as discussed. In addition, traditional 
cell membrane preparations may be used as an initial step to 
enrich cell surface proteins. These methods can include 
gentle cell lysis with a dounce homogenizer and series of 
density gradient centrifugations to isolate membrane pro- 
teins prior to proteolysis. This method can provide highly 
enriched preparations of cell surface proteins. Affinity 
tagged proteins may also be isolated by affinity chromatog- 
raphy prior to proteolysis as well as after proteolysis. This 
chromatography can be performed in the presence of sur- 
factants such as TX-100, NP-40 or Tween-20 to maintain 
protein solubility. The sequential application of affinity 
chromatography steps (one for the intact protein and one for 
the tagged peptide fragments) provides a high degree of 
selectivity. These alternative methods are easily scalable for 
the detection of low abundance membrane proteins and the 
relative quantity of tagged peptides tagged is maintained 
through the selective enrichment steps. 

In the application of the methods of this invention to cell 
surface proteins, once the tagged proteins are fragmented, 
the tagged peptides behave no differently from the peptides 
generated from more soluble samples. 
Synthesis of Affinity Tagged Protein Reactive Reagents that 
are Selective for Certain Proteins Groups 

Synthetic routes of exemplary affinity tagged reagents 
suitable for use in the methods of this invention are provided 
in Schemes 2-3 where well-known synthetic techniques are 
employed in synthesis of the non-deuterated and deuterated 
reagents. 
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Biotinyl-iodoacetylamidyl-4,7,10 trioxatndecanediamine 
4 (Scheme 3) consists of a biotin group, a chemically inert 
spacer capable of being isotopically labeled with stable 
isotopes and an odoacetamidyl group, respectively. The 
biotin group is used for affinity enrichment of peptides 
derivatized with the reagent, the ethylene glycol linker is 
differentially isotopically labeled for mass spectral analysis 
and the iodoacetamidyl group provides specificity of the 
reagent for sulfhydryl-containing peptides. The reagent can 
be synthesized in an all hydrogen form (isotopically light 
form) with 1-20, and preferably 4-8 deuterium atoms in the 
linker (isotopically heavy forms). 

Analysis of Velocities of Multiple Enzymes in Cell Lysates 

Monitoring enzyme functions by biochemical assays is an 
essential diagnostic tool that employs a multitude of ana- 
lytical techniques including spectrophotometric, 
fluorometric, and radiometric detection of products. 
However, current methods are difficult to use for assaying 
several enzymes simultaneously in a single sample. Mass 
spectrometry for quantification of a collection of metabolites 
in biological fluids has emerged as a powerful approach for 
the analysis of birth defects (Morris et al., 1994), but this 
analytical technique has not been developed for the direct 
analysis of rates of individual enzymatic steps. The analyti- 
cal method described herein for monitoring and quantifica- 
tion of enzymatic activities in cell homogenates and other 
biological samples permits simultaneous (multiplex) moni- 
toring of multiple reactions, and can be readily automated. 

A feature of the method of this invention as applied to 
enzyme assays is the use of electrospray ionization mass 
spectrometry (ESI-MS) (Cole et at., 1997) for the simulta- 
neous detection of enzymatic products and chemically iden- 
tical internal standards, which are distinguished by stable 
isotope (deuterium) labeling. A second feature is the use of 
affinity tagged reagents containing an enzyme substrate 
which when combined with affinity purification provides for 
facile capture of enzymatic products from crude biological 
fluids. The affinity tagged reagents are designed to contain a 
target substrate for an enzyme of interest that is covalently 
attached to an affinity tag via a linker. Action of the enzyme 
of interest on the substrate conjugate causes cleavage or 
other modification that changes its molecular mass (Scheme 
4). The change of mass is detected by ESI-MS. The linker 
and affinity tag used preferably facilitate ionization by ESI, 
block action of other enzymes in the biological fluid, and 
allow highly selective capture from the complex matrix for 
facile purification. 

An example of this approach is the design and synthesis 
of affinity tagged enzyme substrate reagents 1 and 2 
(Scheme 5) to simultaneously assay lysosomal 
p-galactosidase and N-acetyl-a-D-glucosaminidase, respec- 
tively. Deficiency of the former enzyme results in one of the 
lysosomal storage diseases, GM a -gangliosidosis, a condition 
that occurs in the population with a frequency of about 1 in 
50,000 and leads to early death of affected children. Defi- 
ciency of N-acetyl-a-D-glucosaminidase results in the rare 
lysosomal storage disorder Sanfilippo syndrome type B. 
This example has been described in Gerber et al. (1999) J. 
Amer. Chem. Soc. 121: 1102-1103 which is incorporated by 
reference herein in its entirety. 

Conjugates 1 and 2 consist of biotin as an affinity tag, 
which is coupled to sar cosine. Biotin allows highly specific 
capture of the substrate conjugate through non-covalent 
binding to streptavidin immobilized on agarose beads 
(Bayer et al., 1990). Sarcosine provides an N-methylated 
amide linkage to biotin to block the enzyme biotinidase, 
which is often present in the cellular fluids and could cause 
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cleavage of the conjugate molecule during the assay (Wilbur reagents 1 and 2 (m/z 843 and 840), internal standards 5 and 
et al., 1997). In addition, it was found that bio tinyl-sar cosine 6 (m/z 689 and 641), and trace amounts of products 3 and 
conjugates can be displaced from streptavidin by addition of 4 (m/z 681 and 637). Ions due to clusters of biotin also 
biotin. The N-biotinylsarcosine block is linked to a polyether appear in the spectrum but did not interfere with the analy- 
diamine, the length of which can be varied to avoid mass/ 5 sis. The presence of nondeuterated products in the blank 
charge overlaps of products and internal standards. The may be due to nonenzymatic substrate reagent hydrolysis 
linker also allows facile introduction of multiple deuterium during sample work up or to collision-induced dissociation 
atoms (i.e., 8 deuteriums in 5 and 4 in 6, Scheme 5) to permit of the substrate ion in the gas phase. A MS/MS spectrum of 
the synthesis of internal standards. The d8 -linker was made the (conjugate 1+H) + ion at m/z 843 gave a prominent 
by reacting D0CH 2 CH 2 0CH 2 CH 2 0D with CD 2 =CDCN in fragment of (conjugate 3+H) + at m/z 681 (spectrum not 
benzene with catalytic NaOD (Ashikaga, K., et al., 1988) shown). The ESI-MS spectrum of a sample incubated with 
and the resulting dinitrile was reduced to the diamine with cell homogenate from a healthy individual clearly shows the 
Ra — Ni. The d4-linker was made in the same way using p-galactosidase product at m/z 681 and the N-acetyl-a-D- 
ethylene glycol and CD2=dCDCN in CH 3 CN and catalytic glucosaminidase product at m/z 637 (FIG. 6B). Triplicate 
NaOH. enzymatic reactions using cells from a healthy patient 
In addition, the linker is hydrophiiic to ensure good water 15 yielded a p-galactosidase specific activity of 51 ±3 nmol/h/ 
solubility of the substrate conjugate, and it has basic groups (mg cell protein) and an N-acetyl-a-D-glucosaminidase 
which are efficiently protonated by ESI and thus ensure specific activity of 1.4±0.3 nmol/h/mg. Time course studies 
sensitive detection by mass spectrometry. The target carbo- confirmed that the initial reaction velocities were being 
hydrate substrates are attached to the polyether linker by a measured. Values obtained with cells from six healthy 
p -alanine unit (Scheme 5). The enzymatic product conju- 20 individuals ranged from 3 6 ±4 to 68 ±3 nmol/h/mg for 
gates 3 and 4 are also shown Scheme 5. Conjugates 1 and 2 p-galactosidase and 0.9 ±0.05 to 2.3±0.4 nmol/h/mg for 
were prepared as shown in Scheme 5. All reagents were N-acetyl-a-D-glucosaminidase. In contrast, very little enzy- 
purified to homogeneity by reverse-phase HPLC and char- matic product above the blank level (0.9±0.9 and 0.8±0.6 
acterized by high-field 1H-NMR and ESI-MS. The substrate nmol/h/mg) was observed when cells from two patients with 
was linked to the diamine spacer by Michael addition of the 25 p-galactosidase deficiency were used, whereas N-acetyl-a- 
latter onto the p-acryloylamidophenyl glycoside, D -glucosaminidase activity is clearly detected (FIG. 6C). 
(Romanowska et al., 1994) and the intermediate was These spectra were obtained with 0.75 jug of cell protein, 
coupled with the tetrafluorophenyl ester of corresponding to -1000 fibroblasts. Thus the ESI-MS 
N-biotinylsarcosine (Wilbur et al., 1997). method has very high sensitivity for biomedical applica- 
ble ESI-MS assay of p-galactosidase and N-acetyl-R-D- 30 tions. 
glucosaminidase is based on enzymatic cleavage of the ESI-MS was carried out on a Finnigan LCQ ion trap 
glycosidic bond to release monosaccharide and conjugates 3 instrument. Data were collected in full scan mode from m/z 
and 4 (mass differences are 162 and 203 Da, respectively). 625 to 875 by direct infusion at 1.5 //L/min. Specific 
In a typical procedure, 0.2 mM 1 and 0.3 mM 2 were activities were obtained from the ratio of product to internal 
incubated with sonicated cultured fibroblasts from indi- 35 standard ion peak areas (averaged over 30 scans), 
vidual patients with p-galactosidase deficiency and with The approach described for assaying enzymes using sub- 
fibroblasts cultured from unaffected people. After strate reagents and ESI-MS can be broadly applied. The 
incubation, labeled internal standards 5 and 6 were added, multiplex technique can be expanded to assay dozens or 
and the biotinylated components were captured on more enzymes simultaneously in a single reaction, obviating 
streptavidin-agarose beads. Quantitative strepavidin capture 40 the need for multiple assays to assist in confirming diag- 
efficiency from a cell homogenate was observed with model noses of rare disorders. The method can be used to measure 
reagents. After purification by multiple washings to remove several enzymes simultaneously when evaluating the rate of 
nonspecifically bound components, the biotinylated prod- chemical flux through a specific biochemical pathway or for 
ucts were released by free biotin, and the eluant was monitoring biochemical signaling pathways. The affinity 
analyzed by ESI-MS. About 85% release of the biotinylated 45 tag-capture reagent method for isolation of affinity tagged 
products was observed after incubation with excess biotin reaction products and substrates from complex mixtures is 
for 90 min. A blank was obtained by quenching the assay technically simple and can be readily automated, particular 
with all components present at time zero. when biotin-strepavidin is employed. Because of the high 
Atypical procedure, cell protein (75 jug) in 15 juL of water sensitivity of the ESI-MS detection employed, which 
was added to 15 juh buffer (0.1 M Na citrate, pH 4.25) 50 requires only sub-microgram quantities of the substrate 
containing 2 (0.3 mM) and 1 (0.2 mM, added 5 h after reagents per assay, the synthesis of several hundred substrate 
addition of cell protein). After incubation for 5.5 h at 37° C, reagents on a low-gram scale becomes practical and eco- 
the reaction was quenched by addition of 200 juL of 0.2 M nomical. Since most enzyme active sites are exposed to 
glycine carbonate buffer, pH 10.3, and 5 and 6 (1 nmol each) solvent, it is possible to attach an affinity tagged linker to 
were added. After centrifugation to remove cell debris, the 55 most enzyme substrates while preserving enzymatic activity, 
supernatant was loaded onto a bed of streptavidin-agarose (7 Scheme 6 provides the structures of several additional 
nmol biotin binding capacity, Pierce) in a small filtration enzyme substrates, suitable for use in this method, indicating 
device (micro BioSpin, Bio-Rad). After 5 min, filtration was by arrows allowable positions for tag attachment sites, 
effected by centrifugation, and the gel bed was washed with Allowable tag sites for additional enzyme substrates can be 
0.1% Triton X-100 (about 1 min incubation, then spin) and 60 determined by review of X-ray crystal structures of enzyme- 
then six times with purified water (Milli-Q, Millipore). substrate or enzyme -substrate analog structures. Using a 
Elution was carried out in 25 juL of 50% methanol contain- standard computer graphics program, available X-ray data 
ing 56 nmol of free biotin (1-10 h incubation, then spin). and by attaching an extended chain butyl group (as a model 
Filtrate was diluted 4-fold with 50% methanol/water, and 1 for the affinity tagged linker) to potential tag attachment 
juL was analyzed by ESI-MS. 65 sites, suitable attachment sites that show there are no 
The ESI-MS spectrum of the blank (FIG. 6A) is remark- enzyme-atoms in van der Wasls overlap with the model tag 
ably simple, showing peaks of the (M+H) + ions from can be predicted. 
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Analogous methods to those described above can be 
applied to the analysis of enzymes associated with other 
Sanfillipo Syndromes (A, C and D). SFA is associated with 
heparin sulfamidase, SFC is associated with acetyl-CoA- 
alpha-glucosaminide N-acetyltransferase and SFD is asso- 
ciated with N-acetylglucosamine 6-sulfatase. Exemplary 
affinity tagged enzyme substrate reagents useful in the 
analysis of these enzymes and the diagnosis of these disor- 
ders are provided below. The methods can also be applied of 
the diagnosis of Niemann-Pick Type A and B disease by 
assaying for acid sphingomyelinase and to the diagnosis of 
Krabbe disease by assaying for galactocerebroside beta- 
galacatosidase. These enzymes are currently assayed 
employing fluorophore-derivatized reagents as indicated in 
Scheme 7. Enzyme substrate reagents for assay of these 
enzymes in the methods herein can be readily prepared by 
replacement of the fluorophore with an A-L group herein. 
This approach to preparation of affinity tagged enzyme 
substrates is generally applicable to any known fluorophore- 
derivatized enzyme substrate or substrate analog. 

Table 4 provides exemplary enzymes that are associated 
with certain birth defects or disease states. These enzymes 
can be assayed by the methods described herein. 
Assaying Enzymatic Pathways for Carbohydrate -Deficient 
Glycoprotein Syndromes (CDGS) 

The methods and reagents of this invention can be 
employed to quantify the velocities of multiple enzymes 
pertinent to diagnosis of CDGS diseases. 

CDGS Type la and lb are caused by the deficiency or 
absence of the enzymes phosphomannoisomerase (PMIb) 
(Type lb) and phosphomannomutase (PMM2) (Type la) 
which are part of a multistep pathway (Scheme 8) for 
conversion of glucose to mannose-1 -phosphate (Freeze, 
1998). The monosaccharide substrates involved in the path- 
way are fructose- 6-phosphate, mannose-6-phosphate, and 
mannose-l-phosphate. These monosaccharides can be 
somewhat difficult to convert to substrate conjugates 
because it is not a priori clear which atom on the sugar 
should be conjugated with the linker without impairing 
enzyme activity. PMIb and PMM2 can, however, be assayed 
indirectly. Mammalian cell microsomes contain dolichol-P- 
mannose synthase which catalyzes the reaction of dolichol- 
phosphate with GDP-mannose to form dolichol-P-mannose 
and GDP (Scheme 8, Chapman et al. 1980). This synthase 
can be assayed using the methods of this invention, specifi- 
cally with a biotin-linker substrate. Microbial PMM and the 
enzyme which makes GDP-mannose from GTP and 
mannose-l-P, GDP-mannose pyrophosphorylase, are readily 
purified from bacteria and yeast (Glaser, 1966, Preiss, 1966), 
and these enzymes can be supplied exogenously to the 
enzyme assay. If PMIb activity is deficient, little or no 
mannose-6-P will be made when the reaction sequence is 
started by addition of fructose-6-P. Without mannose-6-P, 
mannose-l-P and GDP-mannose will not be made, and thus 
no conjugated-dolichol-P-mannose will be detected by ESI- 
MS. Exogenous GTP is supplied as a requirement for the 
GDP-mannose pyrophosphorylase step, and ATP is omitted 
so that mannose-6-P cannot be made from mannose. To 
assay PMM2, the reaction sequence is initiated with 
mannose -6-P, and PMM2 deficiency results in the failure to 
make conjugated-dolichol-P-mannose . 

The carrier dolichol is a -60- to 105 -carbon isoprenoid. 
Evidence is accumulating that many enzymes that operate 
on carbohydrates attached to dolichol chains are tolerant to 
significant shortening of the dolichol chain; even 10- and 
15-carbon dolichols are tolerated (Rush and Wachter, 1995). 
It appears that such enzymes act on the water-soluble 
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carbohydrate portion of the dolichol conjugate and thus have 
little or no requirement to bind the dolichol anchor. Based on 
this, an affinity labeled substrate for the direct assay of 
dolichol-P-mannose synthase and the indirect assay of PMIb 

5 and PMM2 is prepared by attaching an affinity labeled linker 
to the non-polar end of a short dolichol, such as the 
10-carbon dolichol analog citronellol. 

The synthesis of a biotinylated dolichol 10 -substrate con- 
jugate containing a sarcosinyl linker (B — S-Do^O — P) is 

10 shown in Scheme 9. Protected citronellol (R=t-BuSiMe 2 ) is 
regioselectively oxidized at the terminal alyllic methyl 
group (McMurry and Kocovsky, 1984), and the allylic 
alcohol is coupled with biotinylsarcosine active ester 
(R=CH3). The citronellol 1-hydroxy group is subsequently 

15 deprotected and phosphorylated with POCl 3 (Rush and 
Wachter, 1995). In a parallel synthesis, d 5 -sarcosine, 
CD 3 NHCD 2 COOH, is used to prepare the isotopically 
labeled (heavy) reagent for use as an internal standard. 
d 5 -Sarcosine is readily prepared from commercially avail- 

20 able materials (BrCD 2 C00D and CD 3 NH 2 ) using standard 
synthetic techniques. 

The deuterated internal standard, B-d 5 -S-Dol 10 -P- 
Mannose, is synthesized enzymatically by incubating hen 
oviduct microsomes with GDP-mannose and the synthetic 

25 B-d 5 -S-Dol 10 -P substrate conjugate (Rush and Waechler, 
1995). An added advantage of the B — S -conjugate is that it 
allows for a facile affinity purification of the microsomal 
mannosylated product by specific capture on agarose- 
streptavidin beads followed by elution with free biotin. 

30 This method employing affinity tagged short dolichol 
analogues is generally applicable for assaying other 
enzymes that operated on dolichol anchored carbohydrates. 
Such an approach is useful for the subsequent identification 
of enzyme deficiencies present in other types of CDGS that 

35 have not been yet identified. 

CDGS Type II results from defective GlcNAc transferase 
II (GlcNAc-T II) which transfers GlcNAc from UDP- 
GlcNAc to the 2-position of a mannose residue in the 
intermediate branched oligosaccharide (the Core Region) in 

40 the process of building up the disialobiantennary chain 
(Scheme 10) (Schachter, 1986, Brockhausen et al, 1989). 
GlcNAc transferase II is one of the six known enzymes that 
mediate highly regiospecific glycosylation of the mannose 
residues in the Core Region. The Core Region is anchored 

45 at the reducing end to chitobiosylasparagine, where the 
asparagine residue is part of the peptide chain of the gly- 
cosylated protein. The latter structure unit in the substrate 
can be replaced by a hydrophobic chain without loss of 
enzyme activity (Kaur et al, 1991). Thus, the substrate 

50 conjugate for COGS Type II is assembled by linking an 
affinity-labeled linker group to the reducing end of chito- 
biosylasparagine. However, the latter structure unit in the 
substrate can be replaced by a hydrophobic chain without 
loss of enzyme activity (Kaur et al, 1991). For example, 

55 commercially available cx-D-manno- 
pyranosylphenylisothiocyanatecan be coupled to a biotin- 
labeled linker and the 5,6-hydroxyls are selectively pro- 
tected as illustrated in Scheme 11 (Paulsen and 
Meinjohanns, 1992). Coupling of the equatorial 3-OH with 

60 per-O-acetylmannosyl-l-trichloroacetamidate (Paulsen et 
al, 1993) will provide a disaccharide conjugate (Scheme 12). 
If a minor amount of coupling occurs at the axial 2-OH 
group the products can be separated by HPLC. After 
deprotection, the primary 6 -OH is coupled with a second 

65 equivalent of per-O-acetylmannosyl-l-trichloroacetamidate 
to yield the Core Region conjugate. Deprotection of the 
O -acetyl groups yields the substrate conjugate for GlcNAc 
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transferase I which can be converted to the GlcNAc-T II 
substrate by enzymatic glycosyl transfer using a Triton 
X-100 rabbit liver extract, a reaction that has been carried 
out on a preparative scale (Kaur et al. 1981). 

The synthesis of the deuterium labeled derivative needed 5 
for the internal standard is performed in parallel by using a 
labeled PEG-diamine building block (Gerber et al., 1999). 
The biotinylated trisaccharide is converted to the tetrasac- 
charide (product of GlcNAc-T II) by incubation with UDP- 
GlcNAc and transferase II ((Kaur and Hindsgaul, 1991; Tan 10 
et al., 1996) and utilizing the B — S handle for affinity 
purification of the enzymatic products. 
CDGS Type V 

The lipid-linked oligosaccharide (LLO) that is transferred 
to the Asn residue of the glycosylated protein is composed 15 
of 2 GlcNAc, 9 mannoses, and 3 glucoses. It has recently 
been shown that microsomes from CDGS type V patients are 
greatly deficient in the enzyme that transfers one or more 
glucose residues during LLO biosynthesis (Korner et al, 
1998). Since the transferase that attaches the carbohydrate 20 
unit of LLO to the Asn residue discriminates against the 
glucose-deficient LLO, CDGS Type V patients have fewer 
numbers of carbohydrate units attached to glycoproteins, 
such as transferrin (Korner et al., 1998). However, the few 
carbohydrate units that are present are full-length, demon- 25 
strating that residual glucosyl transfer occurs in type V 
CDGS patients (Korner et al., 1998). Thus, quantification of 
the rate of Asn glycosylation by ESI-MS would constitute a 
viable assay of CDGS Type V syndrome. 

Synthetic peptides with 3-7 amino acid residues contain- 30 
ing the Asn-Xaa-Ser/Thr sequence have been shown to be 
good substrates for glycosylation (Ronin et al., 1981). The 
strategy for the ESI-MS assay of the oligosaccharide trans- 
ferase relies on a B — S conjugate of an appropriate peptide 
containing the Asn-Xaa-Ser/Thr sequence (Scheme 13). A 35 
heptapeptide, NH 2 -Tyr-Gln-Ser-Asn-Ser-Thr-Met-NH 2 
(SEQ ID NO:l) has shown high activity in a previous study 
(Ronin et al., 1981). The peptide is readily available by 
standard peptide synthesis using an in-house automatic 
synthesizer. The heptapeptide and its glycoconjugates can be 40 
ionized by ESI to provide stable singly-charged ions. Cou- 
pling of BS-tetrafluorophenyl ester with NH 2 -Tyr-Gln-Ser- 
Asn-Ser-Thr-Met-NH 2 will directly yield the substrate for 
the transferase. Several products are expected from the 
enzymatic glycosylation and subsequent modifications of 45 
the oligosaccharide antenna. The products can be prepared 
enzymatically by incubating thyroid rough microsomes with 
BS-Tyr-Gln-Ser-Asn-Ser-Thr-Met-NH 2 and Dol-P-Glu 
(Ronin et al., 1981a), followed by affinity purification of the 
biotinylated products. Product distribution due to different 50 
degrees of glycosylation can be monitored by ESI-MS, and 
the major components can be purified by HPLC. An analo- 
gous procedure using a B — N(CD 3 )CD 2 CO-conjugate is 
used to prepare deuterated internal standards. 

The molecular masses of the ionized substrate conjugates 55 
for the set of enzymes assayed for CDGS la, lb, II, and V 
syndromes, as well as products and internal standards are 
compiled in Table 5, which shows that no isobaric overlaps 
among the (M+H) + species occur. The close spacing 
between the (M+Na) + ion from the Type la, b product and 60 
the (M+H) + ion of the demannosylated B— (N — C 2 D 5 )-2, 
2-D 2 -Gly-Dol 10 -P internal standard can be readily avoided 
by adjusting the ESI-MS conditions by addition of Na + ions 
to generate the gas phase ions as Na-adducts. 

All three of the targeted enzymes can be analyzed simul- 65 
taneously in a single biological sample, such as a cell lysate. 
The PMM2 and PMIb cannot be assayed simultaneously 



because they require the addition of different exogenous 
substrates. Nevertheless, two assays using identical ESI-MS 
techniques can be used for diagnosing the various CDGS 
types instead of relying on a battery of different methods. 
Clinical Enzymology Assays 

A fibroblast cell pellet is thawed on ice. Sufficient 0.9% 
NaCl is added to give a final protein concentration in the 
lysate of ~5 mg/mL (typically 100 mcL), and the cell pellet 
is sonicated in ice water 5 times for 2 seconds each at 
moderate power. Total protein is determined spectrophoto- 
metrically using the BCA reagent (BCA Protein Assay kit, 
Pierce). 

The total enzyme reaction volume is 20-30 mcL. The 
substrate stock solutions are maintained at concentrations of 
3 mM (SFB) and 2 mM (GM1). These concentrations were 
measured by 1H-NMR signal integration versus an internal 
standard (formamide proton of DMF). Final concentration 
of substrates is 0.3 and 0.2 mM, respectively. A volume of 
reaction buffer (e.g. 200 mM sodium citrate, pH 4.5) equal 
to the difference of the substrate addition (2-3 mcL) plus 
sufficient cell sample volume to equal 50-75 meg total 
protein from 20-30 mcL is added to a 0.5 mL Eppendort 
tube, followed by substrate. The sample is cooled on ice, and 
patient cell sample is added. The reaction is initiated by 
incubation at 37° C. 

For SFB: The reaction is allowed to proceed for 4.5-6 
hours, after which GM1 substrate can be added or the 
reaction can be quenched with 100 mcL of 200 mM glycine- 
carbonate buffer, pH 10.5. 

For GM1: The reaction is allowed to proceed for 0.5 
hours. Quenching is as for SFB. 

After quenching, the samples are placed on ice. Internal 
standards are added (1 nmol each, i.e. 50 mcL of a 0.02 mM 
solution). The samples are microfuged at -15,000 rpm for 2 
min at room temperature to pellet cell debris. Streptavidin- 
Agarose beads (Immunopure immobilized streptavidin, 
Pierce) are placed in a micro bio -spin chromatography 
column (Bio-Rad). Sufficient beads are added to give a total 
biotin binding capacity of 5 nmol (typical binding capacity 
100 pmol per mcL of beads as determined by Pierce). The 
sample supernatant is transferred to the bio-spin tube and 
allowed to bind for 10 minutes at room temperature. The 
sample is spun at -3,000 rpm to remove excess supernatant, 
then washed once with 0.01% Triton X-100 and at least five 
times with purified water, spinning the tube in-between to 
remove solution. For each wash, sufficient wash solution is 
added to fill the bio-spin tube. 

The purified beads are then treated with 30 mcL purified 
water, followed by 10 mcL of a 4 mM biotin solution. The 
tubes are capped at the bottom to prevent leakage and 
allowed to incubate at 2-8° C. for 2-12 hours. The samples 
are spun at -3,000 rpm to elute the sample into a clean 
Eppendorf tube. 

The sample is then diluted with 60 mcL of 50% methanol/ 
water and infused into the ion-trap mass spectrometer. The 
ESI-MS spectrum is tuned to reduce non-specific cleavage 
of the samples by first analyzing a blank sample (cell lysate 
added after reaction quench). The infused sample is ana- 
lyzed by ion chromatogram integration of a 1 amu-wide 
window about the (M+H+) + ions of product and internal 
standard. 

Results are reported in nmol product formed/hour of 
incubation/milligrams total protein in reaction mixtures. 
Clinical Analysis of Patient Samples for GM1 and SFB 

Patient skin fibroblasts were obtained as frozen pellets, 
and stored at -20° C. until use. Two GM1 affected samples 
and six normal controls were analyzed. 
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50 mcL of 0.9% NaCl was added to each patient cell 
pellet. The pellets were lysed by sonication in ice water 5x 
for 2 seconds each at moderate sonication power, chilling the 
microtip in ice water in between sonications. 

Samples were quantitated by BCA (Pierce) assay as 
follows: 

Reagent A and B were mixed in 50:1 ratio as described. 
A protein standard curve was prepared using bovine serum 
albumin as a standard at concentrations of 2, 1, 0.5, 0.2, and 
0.05 mg/mL. A portion of the patient sonicates were diluted 
1:15 in water, and 5 mcL of each diluted patient sample and 
standard curve point was added to separate glass culture 
tubes containing 200 mcL water, in duplicate. The samples 
were then diluted with 1 mL of the mixed BCA reagent, 
vortexed to mix, and incubated at 37° C. for 60 minutes. The 
samples were allowed to cool to room temperature, and 
analyzed against a blank containing only 200 mcL water. 
The samples were analyzed by monitoring absorbance at 
562 nm in polystyrene cuvettes. Average patient absorbance 
values were blank corrected and compared to standards via 
linear regression. 

The patient protein concentrations were determined to be: 

l.(Affected) 12.2 mg/mL, 2. (Normal) 10.8 mg/mL, 3. 
(Normal) 11.9 mg/mL, 4. (Normal) 12.1 mg/mL, 5. 
(Normal) 10.3 mg/mL, 6. (Normal) 7.79 mg/mL, 7. 
(Normal) 15.7 mg/mL, 8. (Affected) 11.4 mg/mL. 

Incubations were performed in a total of 30 mcL of total 
volume. The amount of reaction buffer (200 mM sodium 
citrate, pH 4.25) added to blank Eppendorf tubes was the 
difference of the substrate volume (3 mcL of each substrate 
stock solution, 2 mM for GM1 and 3 mM for SFB, for a total 
of 6 mcL) plus the volume of cell lysate required to equal 75 
meg total protein, from 30 mcL. For example, patient 1. 
incubation mixture initially contained 3 mcL of SFB sub- 
strate solution, 6.14 mcL patient cell lysate, and 17.86 mcL 
reaction buffer. The GM1 substrate was added later in the 
incubation (see below). 

Each patient sample was analyzed in triplicate. The reac- 
tion mixtures were kept on ice during preparation, and the 
reaction was initiated by transfer to a 37° C. water bath. 5.5 
hr into the incubation, 3 mcL GM1 substrate was added to 
each reaction, and after an additional 0.5 hours the reactions 
were placed on ice and quenched with 200 mcL of a 200 mM 
glycine-carbonate buffer, pH 10.25. 

The purification and analysis procedures are as described 
in Clinical Enzymology Assay (Typical). 

The resultant enzyme activities, as an average standard 
deviation nmol product/hour incubation/mg total protein: 
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The following synthetic methods refer to Schemes 14-23. 
Synthesis for GM1 -gangliosidosis (Beta-D-galactosidase 
Deficiency) 

1. 2,3,5,6-Tetrafluorophenyl trifluoroacetate (1) 25 g (0.15 
mol) 2,3,5,6-tetrafluorophenol, 35 mL (0.2 mol) trifluo- 
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roacetic anhydride and 0.5 mL boron trifluoride etherate 
were refluxed for 18 hours under argon atmosphere. 
Trifluo roacetic anhydride and trffluoro acetic acid were 
removed by distillation at room temperature. The trifluo- 
roacetic anhydride fraction was returned to the mixture, 
and the reaction was refluxed for 24 hours. This was 
repeated twice. After final distillation at room 
temperature, the desired product 1 was distilled at reduced 
pressure (62° C./45 mmHg) to produce a colorless liquid 
(30 g, 82%). 1H-NMR. (Gamper, H. B., 1993). 
2. Biotin-2,3,5,6-tetrafluorophenyl ester (2) A 2.5 g (10.3 
mmol) quantity of d-biotin in 20 mL anhydrous DMF 
under argon atmosphere was warmed to 60° C. with 
stirring to effect dissolution. 1.7 mL (12.5 mmol) triethy- 
lamine was added, followed by 3.4 g (12.5 mmol) 1. The 
mixture was stirred for 2 hours, after which the solvent 
was removed by rotary evaporation. The resultant semi- 
solid was triturated with 15 mL ether twice to produce a 
white solid (2.6 g, 65%). 1H-NMR. (Wilbur, D. S., et al., 
1997). 

N-methylglycylbiotinamide -methyl ester (3) A 2.5 g (6.4 
mmol) quantity of biotin tetraffuorophenyl ester in 30 mL 
anhydrous DMF under argon atmosphere was added to a 
mixture of 1.1 g (7.7 mmol) N-methylglycine methyl ester 
hydrochloride dissolved in 10 mL anhydrous DMF and 
1.25 mL (9.0 mmol) triethylamine. The reaction mixture 
was stirred at room temperature for 2 hours, then the 
solvent was removed by rotary evaporation. The residue 
was extracted with chloroform (2x100 mL), washed with 
water (2x20 mL), and dried with anhydrous sodium 
sulfate. The solvent was removed under vacuum to yield 
2.1 g (98%) of methyl ester of N-methylglycine biotina- 
mide as an oil. 1H-NMR. (Wilbur, D. S., etal., 1997). 

4. N-methylglycylbiotinamide acid (4) 
N- Me thy lglycylbiotin amide methyl ester was hydrolyzed 
in a mixture of 31 mL MeOH and 10 mL of IN NaOH at 
room temperature with stirring for 1 hour. The mixture 
was diluted with 50 mL 50% MeOH/water and neutral- 
ized with cation exchange resin, hydrogen form (AG 
MP-50, BioRad). The solution was filtered, the resin 
washed (3x50 mL) with 50% MeOH/water, and the 
solvents removed by rotary evaporation to yield 1.6 g 
(90%) of N-methylglycylbiotinamide acid as an off-white 
solid. 1H-NMR. (Wilbur, D. S., et al., 1997). 

5. p-Acrylamidophenyl-p-D-galactopyranoside (5) 40 mg 
(0.15 mmol) p-aminophenyl p-D-galactopyranoside was 
added to 25 mL methanol and 200 mcL triethylamine with 
stirring. The solution was chilled in an ice bath. 53.3 mg 
(0.6 mmol) acryloyl chloride was dissolved in 5 mL dry 
methylene chloride and added dropwise to the stirred 
solution over 5 minutes. The reaction was allowed to 
return to room temperature, followed by 2 hours of 
stirring. The solution was then treated with successive 
anion and cation exchange resins (AG MP-1 and AG 
MP-50, respectively, BioRad) until neutral pH was 
obtained with moist pH paper. Solvent was removed by 
rotary evaporation to yield a solid (43 mg, 90%). 
1H-NMR. (Romanowska, A., et al., 1994). Michael addi- 
tion product of 4,7,10-trioxa-l,13-tridecanediamine and 5 
(6) 20 mg (0.07 mmol) 5 was added to a stirred solution 
of 80 mg (0.35 mmol) 4,7,10-trioxa-l,13- 
tridecanediamine in 5 mL 0.2M sodium carbonate, pH 
10.5 at 37° C. The reaction was allowed to proceed for 3 
days, after which the solution was neutralized with dilute 
trifluoro acetic acid and purified by reverse -phase HPLC 
(Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: 
H 2 0 (0.08% TFA)/ACN (0.08% TEA)) to give 7.3 mg of 
product. (Romanowska, A., et al., 1994). 
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7. GM1 substrate conjugate of 4 and 6 (7) A 2.5 mg (7.4 
mcmol) quantity of 4 was dissolved in 1 .5 mL anhydrous 
DMF with stirring, under argon atmosphere. 5 mcL tri- 
ethylamine was added, followed by 2.3 mg (8.8 mcmol) 

1. The formation of active ester was monitored by silica 5 
TLC (5:1 CHCI3/CH3OH, Rf 0.5, UV) by briefly drying 
the spotted TLC plate with a stream of air. After 25 
minutes, the mixture was added to 3.2 mg (5.9 mcmol) 6 
in 1 mL anhydrous DMF. After 2 hours, the solvent was 
removed by vacuum centrifugation and the final product 10 
was purified by reverse-phase HPLC (Vydac 0-18 prep- 
scale column, 6 mL/min. Mobile phase: H 2 0 (0.08% 
TFA)/ACN (0.08% TFA)). Yield 4.6 mg. (For analogous 
chemistry, see Wilbur, D. S., et al., 1997). 

8. l,2,10,ll-octadeutero-3,6,9-trioxa-l,ll-undecanedinitrile 15 
(8) 1 g (9.4 mmol) of diethylene glycol was dissolved in 

2 mL D20 in a 10 mL round bottom flask under argon 
atmosphere. The D 2 0 was removed by rotary evaporation 
and the process was repeated 4 times. The d-2 diethylene 
glycol was additioned with 25 mL dry benzene, followed 20 
by 1.6 g (28.2 mmol) d-3 acrylonitrile with stirring under 
argon atmosphere. After 12 h, the solvent was removed 
under reduced pressure and the resultant semisolid was 
extracted with chloroform (2x5 mL). The solvent was 
removed by rotary evaporation to yield 1.85 g (89%) 25 
product. (Ashikaga, K., et al., 1988). 

9. 2,3,ll,12-octadeutero-4,7,10-trioxa-l,13- 
tridecanediamine (9) Raney nickel (Aldrich) was washed 
five times with anhydrous methanol by inversion and 
decantation. 50 mg of the washed catalyst was placed in 30 
20 mL anhydrous methanol, followed by 1 g (4.6 mmol) 

8 in a 50 mL screw-cap vial fitted with a Teflon-lined 
rubber septum. The vial headspace was flushed for a few 
min with H 2 gas via an 18-gauge needle piercing the 
septum. The cap was screwed on tightly and the entire 35 
assembly was charged to 40 psi H 2 and placed in a hot 
water bath (80° C.) for 4 hours, after which the solid 
catalyst was removed by filtration and the methanol 
evaporated. The final product was purified by reverse- 
phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. 40 
Mobile phase: H 2 0 (0.08% TFA)/ACN (0.08% TFA)). 
Yield 180 mg. (Ashikaga, K., et al., 1988). 

10. Deuterated analog of 6 (10) 25 mg (0.09 mmol) 5 was 
added to a stirred solution of 90 mg (0.4 mmol) 9 in 5 mL 
0.2M sodium carbonate, pH 10.5 at 37° C. The reaction 45 
was allowed to proceed for 3 days, after which the 
solution was neutralized with dilute trifluoroacetic acid 
and purified by reverse -phase HPLC (Vydac C-18 prep- 
scale column, 6 mL/min. Mobile phase: H 2 0 (0.08% 
TFA)/ACN (0.08% TFA)). Yield 6 mg. 50 

11. Deuterated analog of 7 (11) A3 mg (8.4 mcmol) quantity 
of 4 was dissolved in 0.7 mL anhydrous DMF with 
stirring, under argon atmosphere. 5 mcL triethylamine 
was added, followed by 2.4 mg (8.9 mcmol) 1. The 
formation of active ester was monitored by silica TLC 55 
(5:1 CHCI3/CH3OH, Rf 0.5, UV) by briefly drying the 
spotted TLC plate with a stream of air. After 25 minutes, 
the mixture was added to 6 mg (11 mcmol) 10 in 1 mL 
anhydrous DMF. After 2 hours, the solvent was removed 
by vacuum centrifugation and the final product was 60 
purified by reverse -phase HPLC (Vydac C-18 prep -scale 
column, 6 mL/min. Mobile phase: H 2 0 (0.08% TFA)/ 
ACN (0.08% TFA)). Yield 1.8 mg. 

12. GM1 internal standard conjugate (12) 1.8 mg 11 was 
added to 2 mL of 100 mM Tris/10 mM MgCl 2 , pH 7.3 65 
buffer with stirring. 15 units recombinant (3-D- 
galactosidase (Sigma) was added, and after 12 hours the 



mixture was purified by reverse-phase HPLC (Vydac 

C-18 prep-scale column, 6 mL/min. Mobile phase: H 2 0 

(0.08% TFA)/ACN (0.08% TFA)). Yield 1.5 mg. 
Polyether Diamine Linker Synthesis (Second Generation) 

Synthesis is based on chemistry previously described 
(Kataky, R. et. al., 1990), with minor modifications and an 
additional two steps. As an example, deviations from the 
established procedure as well as exact details for the addi- 
tional steps are outlined below for the starting material 
diethylene glycol. 

l,ll-Dicyano-3,6,9-trioxaundecane (13) To a stirred solu- 
tion of 2% (w/v) sodium hydroxide (5 mL) and diethylene 
glycol (5.3 g, 50 mmol) was added acrylonitrile (7.95 g, 150 
mmol). The mixture was stirred at room temperature over- 
night and additioned with 50 mL dichloromethane. The 
organic layer was washed 2x with brine and dried (MgS0 4 ). 
The solvent was removed by rotary evaporation. The oily 
residue was treated with 200 proof ethanol, and the solvent 
was removed by rotary evaporation. This was repeated 2x to 
remove excess unre acted acrylonitrile. The product was 
used without further purification 

Diethyl 4,7,10-trioxatridecane-l,13-dioate (14) 2 g (9.4 
mmol) 13, was dissolved in 5 mL ethanol. 1 g cone, sulfuric 
acid was added slowly, over 5 minutes. The reaction was 
heated to reflux overnight. The reaction was extracted with 
40 mL methylene chloride, washed once with 10 mL water 
and 3x with 10 mL dilute brine solution. The organic layer 
was dried (MgS0 4 ) and solvent was removed to yield an oil. 
The final product was purified by silica chromatography 
(methylene chloride/ethyl acetate). 

l,13-dihydroxy-4,7,10-trioxatridecane (15) Prepared 
exactly as described, using tetrahydrofuran as solvent. (1.7 
g, 5.5 mmol 14, 50 mL distilled [CaH 2 ] THF, 0.66 g, 16.5 
mmol lithium aluminum hydride). Once addition was 
complete, excess LAH was quenched with ethanol, and the 
salts precipitated by dropwise addition of saturated sodium 
sulfate solution until a white precipitate formed. The solvent 
was removed, the precipitate washed 6x30 mL with THF 
and the combined organic extracts were evaporated to yield 
an oil. Final product is purified by silica chromatography 
(first with methylene chloride then with ethyl acetate and 
finally with acetone). 

l,13-dichloro-4,7,10-trioxatridecane (analog using P2) 
(16) 1.1 g, (4.9 mmol) 15 was added to 1.15 g (14.6 mmol) 
distilled pyridine in 30 mL dry benzene with stirring, fol- 
lowed by 1.8 g (14.6 mmol) thionyl chloride. The mixture 
was heated to reflux for 6 hours. After cooling in an ice bath, 
5 mL 3M HC1 was added with vigorous stirring. The organic 
layer was separated, washed 3x with a dilute brine solution, 
and dried (NaS0 4 ) to yield a yellowish oil. After washing 
and removal of solvent, the dichloride was used without 
further purification. 

Additional Steps: 

l,13-dicyano-4,7,10-trioxatridecane) (17) To a stirred 
solution of 0.78 g (15.5 mmol) sodium cyanide in 4 mL 
dimethyl sulfoxide at 80° C. was added 1 g (3.9 mmol) of 16. 
After 2 hours, the reaction was additioned with 10 mL of 
saturated sodium chloride solution, 5 mL of water, and 50 
mL ethyl acetate. The organic layer was washed 3x with a 
brine solution as before, after which the organic layer was 
dried (Na 2 S0 4 ) and the solvents removed. The final product 
was purified by silica chromatography (methlyene chloride/ 
ethyl acetate). ESI-MS: predicted, 240.1; observed, 241.1 
(M+H+) + . 

l,15-diamino-5,8,ll-trioxapentadecane (18) A stirred 
solution of 50 mL dry THF containing 0.42 g (10.4 mmol) 
fresh LAH was heated to gentle reflux under argon for 15 
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minutes. 0.5 g (2 mmol) 17 in 15 mL dry THF was added 
dropwise over 20 minutes, maintaining a gentle reflux. The 
unreacted LAH was quenched with ethanol, and the mixture 
was treated with dropwise addition of saturated sodium 
sulfate under efficient stirring until a white precipitate 
formed. The mixture was filtered, and the precipitate was 
washed 6x30 mL with THF. The organic extracts were 
combined and the solvent was removed by rotary evapora- 
tion to yield an oil. ESI-MS: predicted, 248.1; observed, 
249.1 (M+H+) + . 
Deuteration 

Deuterium has been incorporated into the diamine linker 
by reduction of 14 and 17 using lithium aluminum deuteride 
(98% D) to achieve a d-8 deuterated diamine. No other 
aspects of the synthesis were changed for this procedure. 
These diols are used in the construction of the SFD conju- 
gates as described later. 

Clinical Substrate Synthesis for Sanfilippo Syndrome, 
type B (N-a-D-glucosaminidase deficiency). 

13. p-Aminophenyl-a-D-N-acetylglucosamine (19) 20 mg 
(0.07 mmol) p-Nitrophenyl-a-D-N-acetylglucosamine 
(Sigma) was added to 5 mg washed palladium catalyst on 
activated carbon in 3 mL methanol with stirring in a 5 mL 
septa-lined vial. The septum was pierced by a 16-gauge 
needle and the vial headspace was flushed with H 2 gas. H 2 
gas was allowed to slowly bubble through the solution for 
2 hours, after which the catalyst was removed by filtration 
over diatomaceous earth (Celite). The solvent was 
removed by rotary evaporation to yield a semi-solid 18 
mg (90%). 

14. p-Acrylamidophenyl-a-D-N-acetylglucosamine (20) 10 
mg (0.03 mmol) 19 was added to 15 mL methanol and 100 
mcL triethylamine with stirring. The solution was chilled 
in an ice bath. 15 mg (0.17 mmol) acryloyl chloride was 
dissolved in 2 mL dry methylene chloride and added 
dropwise to the stirred solution over 5 minutes. The 
reaction was allowed to return to room temperature, 
followed by 2 hours of stirring. The solution was then 
treated with successive anion and cation exchange resins 
(AG MP-1 and AG MP-50, respectively, BioRad) until 
neutral pH was obtained with moist pH paper. Solvent 
was removed by rotary evaporation to yield a solid (111 
mg, 95%). 1H-NMR. Yield 11 mg. 

15. 3,6-dioxa-l,9-nonanedinitrile (21) 2 g (0.032 mol) eth- 
ylene glycol was added to 0.5 g dry potassium hydroxide 
in 30 mL dry benzene, followed by 5 g (0.096 mmol) 
acrylonitrile with stirring overnight at room temperature. 
The reaction was filtered and the solvent was removed by 
rotary evaporation to yield an oil. Final product was 
purified by silica chromatography (chloroform/methanol) 
to yield a colorless oil 3.2 g (60%). 

16. 4,7-dioxa-l,10-decanediamine (22) Raney nickel 
(Aldrich) was washed five times with anhydrous methanol 
by inversion and decantation. 50 mg of the washed 
catalyst was placed in 20 mL anhydrous methanol, fol- 
lowed by 1 g (6 mmol) 21 in a 50 mL screw-cap vial fitted 
with a Teflon-lined rubber septum. The vial headspace 
was evacuated with H 2 gas via an 16-gauge needle 
piercing the septum. The cap was screwed on tightly and 
the entire assembly was charged to 40 psi H 2 and placed 
in a hot water bath (80° C.) for 4 hours, after which the 
solid catalyst was removed by filtration and the methanol 
evaporated. The final product was purified by reverse - 
phase HPLC (Vydac C-18 prep -scale column, 6 mL/min. 
Mobile phase: H 2 0 (0.08% TFA)/ACN (0.08% TEA)). 

17. Michael addition product of 20 and 22 (23) 5 mg (0.015 
mmol) 20 was added to a stirred solution of 13 mg (0.06 
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mmol) 22 in 5 mL 0.2M sodium carbonate, pH 10.5 at 37° 
C. The reaction was allowed to proceed for 3 days, after 
which the solution was neutralized with dilute trifluoro- 
acetic acid and purified by reverse-phase HPLC (Vydac 
C-18 prep-scale column, 6 mL/min. Mobile phase: H 2 0 
(0.08% TFA)/ACN (0.08% TFA)). Yield 6 mg. 

18. SFB substrate conjugate of 4 and 23 (24) A 4 mg (0.013 
mmol) quantity of 4 was dissolved in 1.5 mL anhydrous 
DMF with stirring, under argon atmosphere. 10 mcL dry 
triethylamine was added, followed by 4 mg (0.015 mmol) 
1. The formation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH3OH, Rf 0.5, UV) by briefly drying 
the spotted TLC plate with a stream of air. After 25 
minutes, the mixture was added to 6 mg (0.012 mmol) 23 
in 1 mL anhydrous DMF. After 2 hours, the solvent was 
removed by vacuum centrifugation and the final product 
was purified by reverse-phase HPLC (Vydac C-18 prep- 
scale column, 6 mL/min. Mobile phase: H 2 0 (0.08% 
TFA)/ACN (0.08% TFA)). Yield 4.2 mg. 

19. l,9-tetradeutero-3,6-dioxa-l,9-nonanedinitrile (25) 0.5 g 
(8 mmol) ethylene glycol was added to 0.1 g dry potas- 
sium hydroxide in 20 mL acetonitrile, followed by 1.4 g 
(24 mmol) d-3 acrylonitrile with stirring overnight at 
room temperature. The reaction was filtered and the 
solvent was removed by rotary evaporation to yield an oil. 
Final product was purified by silica chromatography 
(chloroform/methanol) to yield a colorless oil 0.9 g 
(65%). 

20. l,9-tetradeutero-3,6-dioxa-l,9-nonanediamine (26) 
Raney nickel (Aldrich) was washed five times with anhy- 
drous methanol by inversion and decantation. 20 mg of 
the washed catalyst was placed in 30 mL anhydrous 
methanol, followed by 0.5 g (3 mmol) 25 in a 50 mL 
screw-cap vial fitted with a Teflon-lined rubber septum. 
The vial headspace was evacuated with H 2 gas via an 
18-gauge needle piercing the septum. The cap was 
screwed on tightly and the entire assembly was charged to 
40 psi H 2 and placed in a hot water bath (80° C.) for 4 
hours, after which the solid catalyst was removed by 
filtration and the methanol evaporated. The final product 
was purified by reverse-phase HPLC (Vydac C-18 prep- 
scale column, 6 mL/min. Mobile phase: H 2 0 (0.08% 
TFA)/ACN (0.08% TFA)). 

21. Deuterated analog of 23 (27) 20 mg (0.07 mmol) 
p-acrylamidophenyl p-D-galactoside was added to a 
stirred solution of 90 mg (0.4 mmol) 26 in 5 mL 0.2M 
sodium carbonate, pH 10.5 at 37° C. The reaction was 
allowed to proceed for 3 days, after which the solution 
was neutralized with dilute trifluoroacetic acid and puri- 
fied by reverse -phase HPLC (Vydac C-18 prep -scale 
column, 6 mL/min. Mobile phase: H 2 0 (0.08% TFA)/ 
ACN (0.08% TFA)). Yield 2 mg. 

22. Deuterated analog of 24 (28) A 2 mg (6.3 mcmol) 
quantity of 4 was dissolved in 1.5 mL anhydrous DMF 
with stirring, under argon atmosphere. 5 mcL triethy- 
lamine was added, followed by 2.1 mg (7.6 mcmol) 1. The 
formation of active ester was monitored by silica TLC 
(5:1 CHCI3/CH3OH, Rf 0.5, UV) by briefly drying the 
spotted TLC plate with a stream of air. After 35 minutes, 
the mixture was added to 4 mg (7 mcmol) 27 in 1 mL 
anhydrous DMF. After 2 hours, the solvent was removed 
by vacuum centrifugation and the final product was 
purified by reverse -phase HPLC (Vydac C-18 prep -scale 
column, 6 mL/min. Mobile phase: H 2 0 (0.08% TFA)/ 
ACN (0.08% TFA)). Yield 1.2 mg. 

23. SFB internal standard conjugate (29) 1.2 mg 28 was 
added to 2 mL of 100 mM Tris/10 mM MgCl 2 , pH 7.3 
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buffer with stirring. 15 units recombinant |3-D- 
galactosidase (Sigma) were added, and after 12 hours the 
mixture was purified by reverse-phase HPLC (Vydac 
C-18 prep-scale column, 6 mL/min. Mobile phase: H 2 0 
(0.08% TFA)/ACN (0.08% TEA)). Yield 0.7 mg. 
Clinical Substrate Synthesis for Sanfilippo Syndrome, 
type D (a sulfatase deficiency). 

24. p-Acrylamidophenyl-a-D-N-acetylglucosamine-6- 
sulfate (30) 100 mg (0.28 mmol) 20 was added to 10 mL 
dry DMF under argon atmosphere with stirring at room 
temperature. 89 mg (0.56 mmol) sulfur trioxide-pyridine 
complex was dissolved in 2 mL dry DMF and was added 
to the reaction in 0.7x, l.lx, 1.3x and 1.9x equivalents 
(+700 mcL, +400 mcL, +200 mcL, and +600 mcL). The 
reaction progress was monitored by 1H-NMR shift of the 
anomeric (CI) proton chemical shift from 5.29 to 5.24 
ppm by removal of 15 mcL of solution 1 hour after 
addition of each amount of sulfating reagent. The 
removed mixture was dried by vacuum centrifugation and 
redissolved in d-6 DMSO and analyzed. Upon the appear- 
ance of more than two forms (starting material and C-6 
sulfate) of the CI anomeric proton, the reaction was 
removed to -20° C. and stored. The product was purified 
by vacuum centrifugation to remove solvent, followed by 
reverse-phase HPLC (Vydac C-18 prep-scale column, 6 
mL/min. Mobile phase: H 2 0 (0.08% TFA)/ACN (0.08% 
TFA)). Yield 72%. 

25. Michael addition product of 18 and 30 (31) 25 mg (0.058 
mmol) 30 was added to a stirred solution of 83 mg (0.35 
mmol) 18 in 5 mL 0.2M sodium carbonate, pH 10.5 at 37° 
C. The reaction was allowed to proceed for 3 days, after 
which the solution was neutralized with dilute trifluoro- 
acetic acid and purified by reverse -phase HPLC (Vydac 
C-18 prep-scale column, 6 mL/min. Mobile phase: H 2 0 
(0.08% TFA)/ACN (0.08% TFA)). Yield 10 mg. 

26. SFD substrate conjugate of 4 and 31 (32) A 5.7 mg 
(0.018 mmol) quantity of 4 was dissolved in 1.0 mL 
anhydrous DMF with stirring, under argon atmosphere. 
20 mcL dry triethylamine was added, followed by 5.5 mg 
(0.020 mmol) 1. The formation of active ester was moni- 
tored by silica TLC (5:1 CHCl 3 /CH 3 OH, Rf 0.5, UV) by 
briefly drying the spotted TLC plate with a stream of air. 
After 25 minutes, the mixture was added to 10 mg (0.015 
mmol) 31 in 1 mL anhydrous DMF. After 2 hours, the 
solvent was removed by vacuum centrifugation and the 
final product was purified by reverse-phase HPLC (Vydac 
C-18 prep-scale column, 6 mL/min. Mobile phase 
(0.08% TFA)/ACN (0.08% TFA)). Yield 5.4 mg. 

27. l,2,14,15-octadeutero-l,15-diamino-5,8,ll- 
trioxapentadecane (33) as referenced in Polyether 
Diamine Linker Synthesis, Second Generation. 

28. Deuterated analog of 31 (34) 25 mg (0.07 mmol) 20 was 
added to a stirred solution of 100 mg (0.4 mmol) 11 in 5 
mL 0.2M sodium carbonate, pH 10.5 at 37° C. The 
reaction was allowed to proceed for 3 days, after which 
the solution was neutralized with dilute trifluoro acetic 
acid and purified by reverse-phase HPLC (Vydac C-18 
prep-scale column, 6 mL/min. Mobile phase: H 2 0 (0.08% 
TFA)/ACN (0.08% TFA)). Yield 7 mg. 

29. SFD internal standard conjugate (35) A 4 mg (12.6 
mcmol) quantity of 4 was dissolved in 1 mL anhydrous 
DMF with stirring, under argon atmosphere. 20 mcL 
triethylamine was added, followed by 4 mg (14 mcmol) 1. 
The formation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH3OH, Rf 0.5, UV) by briefly drying 
the spotted TLC plate with a stream of air. After 20 
minutes, the mixture was added to 7 mg (11 mcmol) 34 in 
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1 mL anhydrous DMF. After 4 hours, the solvent was 
removed by vacuum centrifugation and the final product 
was purified by reverse-phase HPLC (Vydac C-18 prep- 
scale column, 6 mL/min. Mobile phase: H 2 0 (0.08% 
TFA)/ACN (0.08% TFA)). Yield 2.7 mg. 
N-(d-Biotinyl-sarcosinyl)-12-aminododecanoic acid (36). 
Compound 4 (32.2 mg, 0.102 mmole) was dried overnight 
in vacuo (with P 2 0 5 ). Dry DMF (2 mL) was added and the 
mixture was stirred with warming to affect dissolution under 
nitrogen. Triethylamine (34 mcL) was added followed by 1 
(20.4 mcL, 0.115 mmole) added in two 10.2 mcL portions, 
5 min apart. The mixture was stirred for 1 hr at room 
temperature under nitrogen. 12-Aminododecanic acid (24.1 
mg, 0.112 mmole, Sigma) was added in one portion, and the 
mixture was stirred at room temperature for 2 hr under 
nitrogen. CHC1 3 (80 mL) was added, and the organic solu- 
tion was washed with two 10 mL portions of 1 M HC1. 
CHCI3 was removed by rotary evaporation, and residual 
DMF was removed by vacuum centrifugation. The com- 
pound was dissolved in methanol and purified by HPLC 
(Vydac 218TP prep column). Solvent program is: 0-10 min, 
water with 0.06% TFA; 10-55 min, 0-100% methanol with 
0.06% TFA, flow rate is 6 mL/min. Yield 31.7 mg. 
1H-NMR. ESI-MS, calculated 513.4, observed 513.4 
(M+H) + . 

N-hydroxysuccinimidyl ester of 36 (37). Compound 36 
(9.8 mg, 19 mcmole) is dissolved in 100 mcL of dry DMF 
under nitrogen. N-hydroxysuccinimide (2.2 mg, 19 mcmole) 
was added followed by dicyclohexylcarbodiimide (3.9 mg, 
19 mcmole). The mixture was stirred at room temperature 
for 60 h in the dark. Solvent was removed by vacuum 
centrifugation, and the residue was submitted to flash chro- 
matography on silica gel using a gradient of CHCl 3 /CH 3 OH 
(15/1) to CHCI3/CH3OH (12/1). Yield 9.8 mg. 1H-NMR. 
ESI-MS, calculated 610.8, observed 609.7 (M+H) + . 

N-(N-(d-Biotinyl-sarcosinyl)-12-aminododecanoyl)- 
pyschosine (38). Compound 37 (6.2 mg, 10 mcmole) and 
pyschosine (4.7 mg, 10 mcmole, Sigma) were dissolved in 
200 mcL of dry DMF under nitrogen. Diisoproylethylamine 
(5 mcL) was added, and the mixture was stirred under 
nitrogen for 2 days in the dark. The compound was injected 
directly onto the HPLC column (Vydac 218TP semi-prep), 
and the column was developed at 2 mL/min with 0-20 min, 
water with 0.06% TFA, then 20-80 min, 0-100% methanol 
with 0.06% TFA. Yield 3.8 mg. 1H-NMR. ESI-MS, calcu- 
lated 957.3, observed 956.8 (M+H)\ 

N-(N-(d-Biotinyl-sarcosinyl)-12-aminododecanoyl)- 
sphingosylphosphorylcholine (39). Sphingosylphosphoryl- 
choline (4.0 mg, Sigma) was mixed with 1 mL dry DMF and 
solvent was removed by vacuum centrifugation. This was 
repeated two more times. The final dried residue weighed 
2.5 mg (5.4 mcmole). To this residue was added 3.3 mg of 
37 (5.4 mcmole), 150 mcL of dry DMF, and 2.5 mcL of 
diisoproylethylamine. The mixture was stirred under nitro- 
gen in the dark for 3 days. The compound was injected 
directly onto the HPLC column (Vydac 218TP semi-prep), 
and the column was developed at 2 mL/min with 0-20 min, 
water with 0.06% TFA, then 20-80 min, 0-100% methanol 
with 0.06% TFA. Yield 3.8 mg. 1H-NMR. ESI-MS, calcu- 
lated 960.3, observed 958.7 (M+H) + . 

Conjugate of d-biotin with l,13-diamino-4,7,10- 
trioxatridecane (40). Compound 2 was reacted with 1,13- 
diamino-4,7,10-trioxatridecane (Fluka) essentially as 
described for the synthesis of 3. The product was purified by 
HPLC (Vydac 218TP, semi-prep) using 0-100% methanol 
with 0.06% TFA over 30 min at 1.5 mL/min. 

Iodoacetylated 40 (41). Compound 40 was treated with 5 
equivalents of iodo acetic anhydride (Aldrich) in dry DMF 
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with stirring under nitrogen for 4 h at room temperature. The 
product was purified on HPLC as for 40. The structure was 
confirmed by ESI-MS. 

Octadeuterated 41 (42). The title compound was prepared 
as for the 40 using 9 instead of l,13-diamino-4,7,10- 5 
trioxatridecane. 

Octadeuterated 42 (43). The title compound was prepared 
from 42 as for 41. The structure was confirmed by ESI-MS. 
Exemplary MS^ Techniques and Instrumentation 

An automated LC-MS/MS system for the identification of 10 
proteins by their amino acid sequence has been developed. 
A schematic representation is shown in FIG. 7. The system, 
which consists of an autosampler, a capillary HPLC system 
connected on-line to an ESI triple quadrupole MS/MS 
instrument and a data system is operated in the following 15 
way: Proteins (typically separated by ID or 2D gel 
electrophoresis) are cleaved with a specific protease, usually 
trypsin, the resulting cleavage fragments are placed in an 
autosampler. Every 37 minutes the autosampler injects one 
sample into the HPLC system and the peptides are separated 20 
by capillary reverse -phase chromatography. As separated 
peptides elute from the chromatography column, they are 
ionized by the ESI process, enter the MS and the mass to 
charge ratio (m/z) is measured. Any peptide ion whose 
intensity exceeds a predetermined intensity threshold is 25 
automatically selected by the instrument and collided in the 
collision cell with inert gas. These collisions result in 
peptide fragmentation, primarily at the bonds of the peptide 
backbone (collision induced dissociation, CID). The masses 
of the CID fragments are measured and recorded in the data 30 
system. The CID spectrum of a peptide contains sufficient 
information to identify the protein by searching sequence 
databases with the uninterpreted MS/MS spectra. This is 
accomplished with the Sequent program. The program iden- 
tifies each peptide in a sequence database which has the 35 
same mass as the peptide that was selected in the MS for 
CID and predicts the MS/MS spectrum for each one of the 
isobanc peptides. By matching the experimentally deter- 
mined CID spectrum with computer generated theoretical 
CID spectra, the protein from which the observed peptide 40 
originated is identified. The system is capable of analyzing 
protein samples in a fully automated fashion at a pace of less 
than 40 mm. per sample. Since each peptide represents an 
independent protein identification and usually multiple pep- 
tides are derived from one protein, protein identification by 45 
this method is redundant and tolerant to proteins 
co-migrating in a gel. The system is well suited for the 
detection and characterization of modified residues within 
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polypeptide chains. The LC-MS/MS technique and auto- 
mated analysis of the generated CID spectra can be used for 
the methods of this invention. 

Identification of Proteins at Sub-femtomole Sensitivity by 
Solid-phase Extraction Capillary Electrophoresis Tandem 
Mass Spectrometry (SPE-CE-MS/MS) 

Protein identification by this method is based on the same 
principle as described above, except that peptide separation 
and ionization are performed at significantly higher sensi- 
tivity. FIG. 8 shows a schematic representation of the key 
design elements. The design of the system and its mode of 
operation have been published. Peptides derived from pro- 
tein digests are concentrated by SPE, separated by CE and 
analyzed by ESI -MS/MS. The resulting uninterpreted CID 
spectra are used to search sequence databases with the 
Sequest software system. The SPE extraction device is a 
small reversed-phase chromatography column of the dimen- 
sions 0.18x1 mm which is directly packed in a fused silica 
separation capillary. Peptides contained in a sample solution 
are adsorbed and concentrated on the SPE device, eluted in 
an estimated 100-300 nl of organic solvent and further 
concentrated by electrophoretic stacking and/or isota- 
chophoresis to an estimated volume of 5-30 nl. The peptides 
are then separated by CE in a 20 /urn or 50 jum i.d. capillary 
and directly ionized by ESI as they leave the capillary (see 
reference 13 for design of the microspray ionization source). 
With this system, peptide masses can be determined at a 
sensitivity of 660 attomoles (approx. 500 fg for a 20 residue 
peptide) at a concentration limit of 33 amol//il and proteins 
can be identified by the CID spectra of automatically 
selected peptides at less than 10 fmol (0.5 ng for a protein 
of 50 kDa) of sample at a concentration limit of less than 300 
amol///l. This technique is used for the analysis at very high 
sensitivity of the peptide samples generated by the experi- 
ments. It has also been demonstrated that the analysis time 
available for automated CID experiments can be signifi- 
cantly extended by data-dependent modulation of the CE 
voltage. If several peptide ions are detected coincidentally in 
the MS, the CE voltage is automatically dropped. This 
results in a reduction of the electroosmotic flow out of the 
capillary and therefore in an extension of the time period 
available for selecting peptide ions for CID. The net effect 
of this peak parking technique is an extension of the 
dynamic range of the technique because the increased time 
available is used for CID of ions with a low ion current. 
Once all the peptide ions are analyzed, electrophoresis is 
automatically reaccelerated by increasing the CE voltage to 
the original value. 



TABLE 1 



Relative, redundant quantitation of a-lactalbumin abundance (after mixing with 
known amount of the same protein with cysteines modified with isotopically heavy 
biotinytlating reagent) 



Peptide m/z Charge Ratio 
# (light) state Peptide Sequence (heavy:light) 



1 518.4 2+ (K) IWCK 2.70 

2 568.4 2+ (K) ALCSEK (SEQ ID NO:2) 2.68 

3 570.4 2+ (K) CEVFR (SEQ ID NO:3) 2.9 

4 760.5 2+ (K) LDQWLCEK (SEQ ID NO:4) 2.82 

5 710.1 3+ (K) FLDDDLTDDIMCVK (SEQ ID NO:5) 2.88 

6 954.2 3+ (K) DDQNP HSSNICNISCDK (SEQ ID 2.9 

NO:6) 
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TABLE 1 -continued 



Relative, redundant quantitation of a-lactalbumin abundance (after mixing with 
known amount of the same protein with cysteines modified with isotopically heavy 
biotinytlating reagent) 

Peptide m/z Charge Ratio 
# (light) state Peptide Sequence (heavy:light) 



7 1286.9 4+ (K) GYGG VS LP EW VCTTFHTSGYDT NA a 
QAIVQNNDSTEYGLFQINNK (SEQ 
ID NO:7) 



a Isotype ratio was not analyzed because on a 4 + peptide the isotype patterns were highly 
overlapping due to differences of only 2 amu between heavy and light ions. 



TABLE 2 

Sequence identification and quantitation of the components of a protein 





micture in a single 


analysis. 












Abserved 




Expected 








ratio 




ratio 


% 


Gene Name* 


Peptide sequence identified 


(dO/d8) T 


Mean ± SD 


(d0/d8r 


error 


LCA_BOVIN 


ALC#SEK (SEQ ID NO:8) 


0.94 










C#EVFR 


1.03 


0.96 ± 0.06 


1.00 


4.2 




FLDDLTDDIMC#VK (SEQ 


0.92 










ID NO:9) 










OVAL_CHICK 


ADHPFLFC#IK (SEQ ID 


1.88 










NO: 10) 




1.92 ± 0.06 


2.00 


4.0 




YPILPEYLQC#VK (SEQ ID 


1.96 










NO:ll) 










BGAL_ECOLI 


LTAAC#FDR (SEQ ID 


1.00 










NO:12) 


0.91 


0.98 ± 0.07 


1.00 


2.0 




IGLNC#QLAQVAER (SEQ 












ID NO:13) 


1.04 










IIFDGVNSAFHLWC#NGR 












(SEQ ID NO:14) 










LACB_BOVIN 


WENGEC#AQK (SEQ ID 


3.64 










NO:15) 




3.55 ± 0.13 


4.00 


11.3 




LSFNPTQLEEQC#HI (SEQ 


3.45 










ID NO:16) 










G3P_RABIT 


VPTPNVSVVDLTC#R 


0.54 










(SEQ ID NO:17) 




0.56 ± 0.02 


0.50 


12.0 




IVSNASC#TTNC#LAPLAK 


0.57 










(SEQ ID NO:18) 










PHS2_RABIT 


IC#GGWQMEEADDWLR 


0.32 










(SEQ ID NO:19) 




0.32 ± 0.03 


0.33 


3.1 




TC#AYTNHTVLPEALER 


0.35 










(SEQ ID NO:20) 












WLVLC#NPGLAEIIAER 


0.30 










(SEQ ID NO:21) 











*Gene names are according to Swiss Prot nomenclature (www.expasy.ch). 
^Ratios were calculated for each peptide as shown in FIG. 3. 

"Expected ratios were calculated from the known amounts of proteins present in each mixture. 
#ICAT-labeled cysteinyl residue. 



TABLE 3 





Protein profiles from veast growing on galacto 


se or ethanol 


as a carbon source. 






Observed 




Gene 




ratio 1 " 


Galactose- Glucose- 


name* 


Peptide sequence identified 


(Eth:Gal)= 


repressed § repressed § 


ACH1 


KHNC#LHEPHMLK (SEQ ID NO:22) 


>100:1 


/ 


ADH1 


YSGVC#HTDLHAWHGDWPLPVK (SEQ 


0.57:1 






ID NO: 23) 


0.48:1 






C#C#SDVFNQVVK (SEQ ID NO:24) 






ADH2 


YSGVC#HTDLHAWHGDWPLPTK (SEQ 


>200:1 


/ / 




ID NO:25) 


>200:1 






C#SSDVFNHVVK (SEQ ID NO:26) 






ALD4 


TFEVINPSTEEEIC#HIYEGR (SEQ ID 


>100:1 


/ / 




NO:27) 






BMH1 


SEHQVELIC#SYR (SEQ ID NO:28) 


0.95:1 




CDC19 


YRPNC#PIILVTR (SEQ ID NO:29) 


0.49:1 
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TABLE 3 -continued 

Protein profiles from yeast growing on galactose or ethanol as a carbon source. 

Observed 



Gene 




ratio 1 " Galactose- 


Glucose- 


name* 


Peptide sequence identified 


mia Gal) = repressed 5 


repressed § 




NC#TP KPTSTTET VAAS AVAAVFEQK 


0.65:1 






(SEQ ID NO:30) 


0.67:1 






AC#DDK 






FBA1 


SIAPAYGIPVVLHSDHC#AK (SEQ ID 


0.60:1 






NO:31) 


0.63:1 






EQVGC#K (SEQ ID NO:32) 






GAL1 


LTGAGWGGC#TVHLVPGGPNGNIEK 


1:>200 


/ 




(SEQ ID NO:33) 






GAL10 


HHIPFYEVDLC#DR (SEQ ID NO:34) 


1:>200 


/ 




DC#VTLK (SEQ ID NO:35) 


1:>200 




GCY1 


LWC#TQHHEPE VALDQS LK (SEQ ID 


0.34:1 


/ 




NO:36) 






GLK1 


IC#SVNLHGDHTFSMEQMK (SEQ ID 


0.65:1 






NO:37) 






GPD1 


IC#SQLK (SEQ ID NO:38) 


0.54:1 


/ 


ICL1 


GGTQC#SIMR (SEQ ID NO:39) 


>100:1 


/ 


IPP1 


NC#FPHHGYIHNYGAFPQTWEDPNVS- 


0.76:1 






HPETK (SEQ ID NO:40) 






LPD1 


VC#HAHPTLSEAFK (SEQ ID NO:41) 


1.30:1 


/ 


PEP4 


KGWTGQYTLDC#NTR (SEQ ID NO:42) 


2.60:1 


/ 


PSA1 


SVVLC#NSTIK (SEQ ID NO:43) 


0.56:1 




PGM2 


C#TGGIILTASHNPGGPENDMGIK (SEQ 


0.58:1 


/ 




ID NO:44) 


0.62:1 






LSIC#GEESFGTGSNHVR (SEQ ID 








NO:45) 






PCK1 


C#PLK 


1.59:1 






TD/^4iT A nCUDF /OCA TT~\ VT f~\ ■ A \ 

IrCtfLAJJorlrK (ockj ID rsDAu) 


1.47:1 








1 52 - l 


/ 

V 




C#AYPIDYIPSAK (SEQ ID NO:48) 


1.41:1 






IVEEPTSKDEIWWGPVNKPC#SER 


1.85:1 






(SEQ ID NO:49) 






QCR6 


ALVHHYEEC#AER (SEQ ID NO:50) 


1.30:1 


/ 


RPL1A 11 


SC#GVDAMSVDDLKK (SEQ ID NO:51) 


0.82:1 




SAH1 


HPEMLEDC#FGLSEETTTGVHHLYR 


0.62:1 






(SEQ ID NO:52) 


0.74:1 






EC#INIKPQVDR (SEQ ID NO:53) 






SOD1 


GFHIHEFGDATNGC#VSAGPHFNPFK 


0.46:1 


/ 




(SEQ ID NO:54) 






TEF1 


RGNVC#GDAK (SEQ ID NO:55) 


0.81:1 






C#GGIDK (SEQ ID NO:56) 


0.70:1 






FVPSKPMC#VEAFSEYPPLGR (SEQ ID 


0.74:1 






NO:57) 






VMA2 


IPIFSASGLPHNEIAAQIC#R (SEQ ID 


0.70:1 






NO:58) 






YHB1 


HYSLC#SASTK (SEQ ID NO:59) 


0.69:1 





*Gene names are according to the Yeast Proteome Database (YPD) (19). 
# Cysteinyl residue is ICAT-labeled. 

^Protein expression ratios were calculated as described in FIG. 3. 

= Carbon source for yeast growth was 2% ethanol (Eth) or 2% galactose (GAL0. 

§ Gene is known to be galactose- or glucose-repressed (19). 

^Eight other ribosomal proteins were detected at similar gene expression levels. 



TABLE 4 



Disease 



Enzyme 



Dysfunction 



Butyrylcholinesterase deficiency BCHE 



Essential fructosuria hepatic 
fmctokinase deficiency 
Hereditary fructose intolerance 

Erythrocyte aldolase deficiency 
with nonspherocytic hemolytic 
anemia (aldolase A deficiency) 
Glycogen storage disease type 
la (von Gierke disease) 
Glycogen storage disease type 
lb" 



Fuctokinase 

Fructose 1,6- 
diphosphate deficiency 
Fructose 1,6- 
biphosphatase aldolase A 

Glucose 6-phosphatase 

Glucose 6-phosphate 
tranlocase 



Decreassed or absent 
enzyme activity 
Deficient enzyme activity 

Deficient enzyme activity 
bisphosphatase 
Deficient enzyme activity 



Absent or deficient 
enzyme activity 
Deficient transport of 
glucose 6-phosphate 
across the membrane of 
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Disease 



Enzyme 



Dysfunction 



Glycogen storage disease type 
III" 

Glycogen storage disease type 

IV (Andersen disease) 
Glycogen storage disease type 

V (McArdle disease) 
Glycogen storage disease X- 
linked phosphorylase kinase 
deficiency 

Glycogen storage disease 
autosomal phosphorylase 
kinase deficiency 
Glycogen storage disease liver 
phosphorylase deficiency 
Glycogen storage disease type 
VII (Tarui disease) 
Liver glycogen synthase 
deficiency 

Phosphoglycerate kinase 
deficiency 

Phosphoglycerate mutase 
deficiency 

Muscle lactate dehydrogenase 
deficiency 



Glucose phosphate isomerase 
deficiency 

Transferase deficiency 
galactosemia 
Galactokinase deficiency 
galactosemia 
Epimerase deficiency 
galactosemia 



Phenylketonuria (PKU) due to 
PAH deficiency 
Hyperphenylalaninemia due to 
DHPR-deficiency 
Hyperphenylalanineemia due to 
GTP-CH-deficiency 
Hyperphenylalaninemia due to 
6-PTS-deflciency 

Oculocutaneous tyrosinemia 
(tyrosinemia type II; tyrosine 
amino-transferase deficiency) 
4-Hydroxyphenylpyruvic acid 
dioxygenase (tyrosinemia type 
III) 

Maleylacetoacetate isomerase 
deficiency (tyrosinemia type lb) 
(tentative) 

Hepatorenal tyrosinemia 
(tyrosinemia type I: 
fumarylacetoacetate hydrolase 
deficiency) 

Carbamyl phosphate synthetase 
deficiency 

Orinthine transcarbamylase 
deficiency 

Arginino succinic acid 
synthetase deficiency 
Arginino succinase deficiency 
Arginase deficiency 
Familial hyperlysinemia 
(variant: saccharopinuria) 
Maple syrup urine disease 
(MSUD) or branched chain 
ketoacidemia 



Amylo-1, 6-glucosidase 
(debrancher enzyme) 
a-1, 4 glucan-6-a- 
glucosyltransferase 
Muscle glycogen 
phosphorylase 
Phosporylase b -kinase 



endoplasmic reticulum 
Absent or deficient 
enzyme activity 
Deficient enzyme activity 

Absent or deficient 
enzyme 

Deficient or absent 
enzyme activity function 



Phosphorylase b-kinase Deficient enzyme activity 



Liver phosphorylase 
Muscle 

phosphofructokinase 1 
Liver glycogen synthase 

Phosphoglycerate kinase 

Phosphoglycerate 
mutase 

Muscle-specific subunit of 
lactate dehydrogenase 
(LDH) 

Glucose phosphate 
isomerase 

Galactose 1-phosphate 

uridyltransferase 

Galactokinase 

Uridine diphosphate 
galactose-4-epimerase 



Phenylalanine 
hydroxylase (PAH) 
Dihydropteridine 
reductase (DHPR) 
Guanosine triphosphate 
cyclohydrolase (GTP-CH) 
6-Pyruvoyl 

tetrahydropterin synthase 

(6-PTS) 

Tyrosine 

aminotransferase 

4-Hydroxy-phenylpyruvic 
acid dioxygenase 

Maleylacetonacetate 
isomerase 

Fumarylaceoacetate 
hydroxylase 



Carbamyl phosphate 
synthetase I 
Orinthine 
transcarbamylase 
Argininosuccinic acid 
synthetase 

Argininosuccinate lyase 
Liver arginase 
a-Aminoadipic 
semialdehyde synthase 
Branched-chain a-keto 
acid dehydrogenase 



Deficient enzyme activity 

Deficient enzyme activity 

Unknown 

Deficient enzyme 

Deficient enzyme 

Absence of M subunit of 
LDH. Muscle LDH is a 
tetramer of the heart- 
specific subunit 
Unknown 

Deficient enzyme activity 

Deficient enzyme activity 

Deficient enzyme action 
in blood cells only 
(benign) or, more rarely, 
in all tissues (generalized) 
Deficient or absent PAH 
activity (<1% normal) 
Deficient or absent DHPR 
activity 

Deficient enzyme activity 
Deficient enzyme activity 

Decreased activity 

Decreased activity 



Presumably decreased 
enzyme activity 

Deficient enzyme activity 



Absent or deficient 
enzyme activity 
Absent or reduced 
enzyme activity 
Deficient enzyme activity 

Deficient enzyme activty 
Deficient enzyme activity 
Deficient enzyme activity 

Deficient or absent (<2%) 
BCKAD complex activity 
in mitochondria; 
immunologic absence or 
reduced levels of enzyme 
subunits; impairment of 
El subunit assembly 
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Disease 



Enzyme 



Dysfunction 



Cystathionine p-synthase 
deficiency 

a-Cystathionase deficiency 
Hepatic methionine 
adenosyltransferase deficiency 
Sarcosinemia 

No nke tonic hyperglycinemia 
Hyperuracil thyminuria 

Dihydr op yr imidinur ia 
Pyridoxine dependency with 
seizures 

GABA aminotransferase 
deficiency 

4-Hydroxybutyric aciduria 

Serum carnosinase deficiency 
and homocarnosinosis 
Alkaptonuria 

Isovaleric acidemia 



Isolated 3-methylcrotonyl-CoA 
carboxylase deficiency 
3-Methylglutanconic aciduria 
Mild form: 

3-Hydroxy-3-methylglutaryl- 
CoA lyase deficiency 
Mevalonic aciduria 
Mitochondrial acetoacetyl-CoA 
thiolase deficiency 

Propionic acidemia (2 nonallelic 
forms designated pccA and 
pccBC) 

Methylmalonic acidemia (2 
allelic variants designated mut° 
and mut" 



Glutaric acidemia type I 
Cytochrome oxidase deficiency 



Cystathionine p- synthase Deficient enzyme activity 



Pyruvate dehydrogenase 
complex deficiency-Ei 
decarboxylase component 
Pyruvate dehydrogenase E 2 
transacylase 

Combined a-ketoacid 
dehydrogenase 
deficiency/lipoamide 
dehydrogenase deficiency 
Pyruvate carboxylase deficiency 



Carnitine palmitoyl transferase I 

(CPT I) deficiency 

Carnitine/ acy lcarnitine 

translocase deficiency 

Carnitine palmitoyl transferase II 

(CPT II) deficiency 

Very long- chain acyl-CoA 

dehydrogenase (VLCAD) 

deficiency 

Long-chain acyl-CoA 
dehydrogenase (LCAD) 
Long-chain L-3-hydroxyacyl- 
CoA dehydrogenase (LCHAD) 
deficiency 

Trifunctional enzyme (TFE) 



a-Cystathionase 

Isoenzyme of methionine 

adenosyltransferase 

Sarcosine 

dehydrogenase? 

Glycine cleavage system 

Dihydropyrimidine 

dehydrogenase 

Dihydropyrimidinase 

Brain glutamic acid 

decarboxylase- 1 

GABA-a-ketoglutarate 

transaminase 

Succinc semialdehyde 

dehydrogenase 

Serum carnosinase 

Homogentisic acid 
oxidase 

Isovaleryl-CoA 
dehydrogenase 

3-Methylcrotonyl-CoA 
carboxylase 

3-Methylglutaconyl-CoA 
hydratase 
3-Hydroxy-3- 
methylglutaryl-CoA lyase 
Mevalonate kinase 
Mitochondrial 
acetoacetyl-CoA thiolase 
(T2) 

Propionyl-CoA 
carboxylase (PCC) 



Methyldmalonyl-CoA 
mutase (MUT) 
apo enzyme 



mut" 

Glutaryl-CoA 
dehydrogenase 
Cytochrome oxidase 
polypeptides 

Pyruvate decaroxylase, 
Ejd 

Dihydrolipo amide 
transacylase 



Lip o amide 
dehydrogenase 



Pyruvate carboxylase 



Carnitine palmitoyl 
transferase I 
Carnitine/acylcarnitine 
translocase 
Carnitine palmitoyl 
transferase II 

Very long-chain acyl-CoA 
dehydrogenase 

Long-chain acyl-CoA 



L-3-hydroxyacyl-CoA 
dehydrogenase 

Trifunctional enzyme Defi- 



Deficient enzyme activity 
Deficient enzyme activity 

Deficient enzyme activity 

Deficient enzyme activity 
Deficient enzyme activity 

Unknown 

Deficient coenzyme 
binding? (brain) 
Deficient enzyme activity 

Deficient enzyme activity 

Deficient enzyme 

Absent or deficient 
enzyme activity 
Deficient enzyme activity, 
deficient protein, 
abnormal peptide size 
Deficient enzyme activity 

Deficient enzyme activity 

Deficient enzyme activity 

Deficient enzyme activity 
Deficient enzyme activity, 
decreased protein, 
unstable protein 
Deficient enzyme activity 
(nonalleic forms reflect 
mutations in nonidentical 
subunits of PCC) 
Absent MUT activity in 
mut°, deficient MUT 
activity due to reduced 
affinity for cofactor 
(adenosylcobalamin) in 

Deficient enzyme activity 

Decreased activity of the 
cytochrome oxidase 
complex 

Decreased enzyme 
activity, decreased 
protein 

Decreased enzyme 
activity; abnormal protein 
electrophoretic mobility 
Decreased enzyme 
activity 



Absent enzyme activity; 7 
cases absent enzyme, 
protein, and mRNA 
Deficient enzyme 

Deficient translocase 

Deficient enzyme 

Deficient enzyme 

Deficient enzyme 
Deficient enzyme 
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Disease 



Enzyme 



Dysfunction 



deficiency 

Dienolyl-Co reductase 
deficiency 

Medium-chain acyl-CoA 
dehydrogenase (MCAD) 
deficiency 

Short-chain acyl-CoA 
dehydrogenase (SCAD) 
deficiency 

Glutaric acudemia type II 



Glycerol kinase deficiency 
(Gkd) 



cient enzyme 

2,4-dienoyl-CoA 

reductase 

Medium-chain acyl-CoA 
dehydrogenase 

Short-chain acyl-CoA 
dehydrogenase 

Electron transfer 
flavoprotein (ETF); 
ETF:ubiquinone 
oxido reductase 
Glycerol kinase 



Primary gout: super active 
variant of 

phosphoribosylpyrophosphate 
(PP-ribose-P) synthetase 
Primary gout: partial deficiency 
of hypoxanthine guanine 
p ho sp ho ribo sy 1 transf eras e 
(HPRT) 

Lesch-Nyhan syndrome: 
deficiency of hypoxanthine 
guanine 

p hosp ho ribo sy 1 transf eras e 
(HPRT) 

2,8-Dihydroxyadenine lithiasis 
(adenine 

p ho sp ho ribo sy 1 transf eras e 
deficiency) 

Adenosine deaminase 
deficiency with severe combined 
immunodeficiency disease 
Purine nucleoside 
phosphorylase deficiency with 
cellular immunodeficiency 
Myoadenylate deaminase 
deficiency 
Xanthinuria 



PP-ribose-P synthetase 



Hypoxanthine guanine 
phosphoribosyl 
transferase (HPRT) 

Hypoxanthine guanine 
phosphoribosyltrans- 
ferase (HPRT) 



Adenine 

phosphoribosyltransferase 
Adenosine deaminase 



Purine nucleoside 
phosphorylase 

Myoadenylate deaminase 
(AMPDI) 

Xanthine dehydrogenase 
(xanthine oxidase) 



Hereditary orotic aciduria 

Pyrimidine 5'-nucleotidase 
deficiency 
Dihydropyrimidine 
dehydrogenase deficiency 
Dihydropyrimidase deficiency 

Familial lipoprotein lipase 
deficiency 



Familial lecithin: cholesterol 
acyltransferase deficiency 
5- Aminolevulinic acid 
dehydratase porphyria 
Acute intermittent porphyria 



Congenital erythropoietic Uroporphyrinogen III 

porphyria cosynthase 

Porphyria cutanea tarda (familial Uroporphyrinogen 

form) decarboxylase 

Hepatoerythropoietic porphyria Uroporphyrinogen 
decarboxylase 

Hereditary coproporphyria 



UMP synthase 

Pyrimidine 5'- 

nucleotidase 

Dihydropyrimidine 

dehydrogenase 

Dihydropyrimidase 

Lipoprotein lipase 



Lecithin:cholesterol 
acyltransferase 
6-Aminolevulinic acid 
dehydratase 
Porphobilinogen 



Variegate porphyria 



Cop r op orphy rino gen 
oxidase 

Pro toporphy rino gen 
oxidase 



Deficient enzyme 
Deficient enzyme 

Deficient enzyme 



In some cases, no 
enzyme antigen; in 
others, no enzyme 
activity 

The micro deletion 
involves not only GK but 
also the other deleted 
loci: AHC, DMD, OTC, 
and other lkinked loci 
Enhanced enzyme 
activity 



Absent or deficient 
enzyme activity 

Absent or deficient 
enzyme activity 



Type I: absent enzyme 
activity; type II: reducted 
affinity for PP-ribose-P 

Absent or greatly 
diminished enzyme 
activity 

Absent or greatly 
diminshed enzyme 
activity 

No enzyme activity; no 
immuno reactive protein 
Type I: absent xanthine 
dehydrogenase activity; 
type II: absent xanthine 
dehydrogenase and 
aldehyde oxidase activity 
Deficient enzyme activity 
(unstable protein) 
Absent or unstable 
enzyme 

Absent or unstable 
enzyme 

Absent or unstable 
enzyme 

Nonfunctional protein in 
some, nondectable 
enzyme activity and 
protein in others 
Absent enzyme protein or 
deficient enzyme activity 
Minimal enzyme activity 

Decreased enzyme 
activity (-50%) 
Minimal enzyme activity 

Decreased enzyme 
activity (-50%) 
Minimal enzyme activity 

Decreased enzyme 
activity (-50%) 
Decreased enzyme 
activity (-50%) 
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TABLE 4-continued 



Disease 



Enzyme 



Dysfunction 



Erythropoietic protophorphyria Ferrochelatase 

Crigler- Najjar syndrome, type I Bilirubin UDP- 

glucuronsyltransferase 
Crigler- Najjar syndrome, type II Billirubin UDP- 

glucuronosyltransferase, 
Gilbert syndrome Bilirubin UDP- 

glucur onosyl transf eras e 
activity 

Phytanic acid a- 
hydroxylase 
Alanine-glyoxylate 
aminotransferase 



Refsum disease 
Primary hyperoxaluria type 1 

Primary hyperoxaluria type 2 

G M2 gangliosidosis: 
hexosamindase a-subunit 
deficiency (variant B, Tay-Sachs 
disease) 

Glycogen storage disease type 
II 

Mucopolysaccharidosis I 
(Hurler, Scheie, and Hurler- 
Sheie syndromes, MPS, MPS 
IS, MPS IH/S) 
Mucopolysaccharidosis II 
(Hunter syndrome) 
Mucopolysaccharidosis III 
(Saniflippo syndrome) types A, 
B, C and D 



Mucopolysaccharidosis IV 
(Morquio syndrome) types A 
and B 

Mucopolysaccharidosis VI 
(Maroteaux-Lamy syndrome) 
Mucopolysaccharidosis VII (Sly 
syndrome) 

I-cell disease (ML-II) 

Schindler disease (a-N-acetyl- 
galactosaminidase deficiency) 
a-Mannosidosis 

p-Mannosidosis 
Sialidosis 

Asp ar tylgluco saminur ia 
Fucosidosis 

Wolman disease and cholestryl 
ester storage disease 
Ceramidase deficiency (Farber 
lipogenulomatosis) 
Nielmann-Pick disease (NPD) 
types A and B (primary 
sphingomyelin storage) 
Gaucher disease type I 
(no nneur o nopathic) 



Globoid-cell leukodystrophy 
(Krabbe disease) 
Metachromatic leukodystrophy 
Fabry disease 



hexosaminidase a-subunit 
deficiency (variant B, Tay-Sachs 
disease) 

Steroid 21 -hydroxylase 
deficiency salt- losing form 



Glyoxylate reductase/D- 
glycerate dehydrogenase 
^-hexosaminidase 



a-glucosidase 
a-L-iduronidase 

Iduronate sulfutase 

IIIA: Heparan N-sulfatase 

IIIB: a-N-acetyl- 

glucosaminidase 

IIC: Acetyl-CoA: a- 

glucosaminidine 

acetyltransferase 

HID: N-acetyl- 

glucosamine-6-sulfatase 

IVA: Galactose 6- 

sulfatase 

IVB: p-Galactosidase 
N-acetyl-galactosamine 
4-sulfatase 
p-glycuronidase 

N-acetylglucosaminyl-1- 

phosphotransferase 

a-N-acetyl- 

galactoaminidase 

a-D-mannosidase 

p-D-mannosidase 
a-neuraminidase 
Aspartyglucosaminidase 
a-L-fucosidase 
Acid lipase 

Ceramidase 

Sphingomyelinase 

Glucoserebrosidase 



Galactosylceramidase 

Arylsulfatase A 
a-Galactosidase A 

Acid p-galactosidase 
(GLBI) 

^-hexosaminidase 



Decrease enzyme activity 
(-50%) 

Absent enzyme activity 

Markedly reduced 
enzyme activity 
Reduced enzyme activity 



Deficient enzyme activity 

Loss of enzyme catalytic 
activity and aberrant 
subcellular distribution 
Loss of enzyme catalytic 
activity 

Absent or defective 
hexosaminidase A (dp) 
activity 

Absent or deficient 
enzyme activity 
Absent enzyme activity 



Absent enzyme activity 
Absent enzyme activity 



Steroid 21-hydroxylase 



Absent enzyme activity 



Absent enzyme activity 

Absent enzyme activity 

Phorphorylation of many 
lysosomal enzymes 
Deficient activity of a-N- 
acetylgalactosaminidase 
Deficient or unstable 
enzyme activity 
Deficient enzyme activity 
Deficient enzyme activity 
Deficient enzyme activity 
Deficient enzyme activity 
Deficient enzyme activity 

Deficient enzyme activity 

Deficient 

sphingomyelinase activity 

Decreased catalytic 
activity and some 
instability of enzyme 
protein 

Absent enzyme activity 

Deficient enzyme activity 
Nonfunctional or unstable 
enzyme protein 
Deficient enzyme activity 

Absent or defective 
hexosaminidase A (ap) 
activity 

Absent or truncated 
enzyme with no activity 
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Disease 



Enzyme 



Dysfunction 



Steroid 5a-reductase 2 



Steroid 5a-reductase 2 
deficiency 

Steroid sulfatase deficiency (X- 3 [3- hydroxy steroid 
linked ichthyosis) 



sulfatase 



Methylenetetrahydrofolate 
reductase deficiency 



Holocarboxylase synthetase 
deficiency 

Biotinidase deficiency 

Hereditary methmoglobinemia 
secondary to cytochrome b 5 
reductase deficiency, types I, II, 
and III 



Pyruvate kinase deficiency 
hemolytic anemia 
Hexokinase deficiency 
hemolytic anemia 
Glucosephosphate isomerase 
deficiency hemolytic anemia 
Aldolase deficiency hemolytic 
anemia 

Trios ephosp hate isomerase 
deficiency hemolytic anemia 
Phosphoglycerate kinase 
deficiency hemolytic anemia 
2,3-Diphospho-glyceromutase 
and phosphatase deficiency 

6-Phosphogluconate 
dehydrogenase deficiency 
Glutathione peroxidase 
deficiency 

Glutathione reductase 
deficiency 

Glutathione synthetase 
deficiency hemolytic anemia 
y-Glutamylcysteine synthetase 
deficiency hemolytic anemia 
Adenosine deaminase 
hyperactivity hemolytic anemia 



Pyrimidine nucleotidase 
deficiency hemolytic anemia 
Myleoperoxidase deficiency 

Carbonic anhydrase II 
deficiency syndrome 
(osteopetrosis with real tubular 
acidosis) 

Albinism, oculocutaneous 
tyrosinase-negative type 
(OCAIA) 
Canavan disease 



Methylenetetrahydro- 
folate reductase 



Holocarboxylase 

synthetase 

Biotinidase 

Cytochrome b s reductase 



Pyruvate kinase 

Heokinase 

Glucosephosphate 
isomerase 
Aldolase (A type) 

Triophosphate 
isomerase 

Phosophoglycerate kinase 

2,3-Diphospho-glycerate- 
mutase and phosphatase 
(1 protein) 
6-Phosphogluconate 
dehydrogenase 
Glutathione perocidase 

Glutathione reductase 

Glutathione synthetase 

y-Glutamylcysteine 
synthetase 

Adenosine deaminase 



Pyrimidine nucleotidase 



Myeloperoxidase 



Carbonic anhydrase II 



Tyrosinase 



Aspartoacylase 



Absent or unstable 
enzyme activity 
Absent immuno reactive 
and enzymatically active 
protein (both deletion and 
nondeletion patients) 
Absent or deficient 
enzyme activity. 
Thermolabile variants 
have been described. 
Deficient holocarboxylase 
synthetase activity 
Deficient biotinidase 
activity 

Deficient enzyme activity 
in erthrocyte cytosol only 
(type I), in all tissues 
(type II), and in all 
hematopoietic cells (type 
III) 

Deficient enzyme activity 

Deficient enzyme activity 

Deficient enzyme activity 

Deficient enzyme activity 

Enzyme activity deficient 
in all tissues 

Deficient enzyme activity 
in hemizygotes 
Deficienct enzyme activity 



Enzyme activity 
deficiency 
Diminshed enzyme 
activity 

Deficient enzyme activity 

Deficient enzyme activity 

Deficient enzyme activity 

Overproduction of 
structurally normal 
enzyme protein mediated 
at mRNA translation level 
Deficient enzyme activity 

Absent or deficient 
enzyme activity 
Quantitative deficiency of 
carbonic anhydrase II 



Absent, reduced, or 
unusual enzyme activity 

Deficient enzyme activity 
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TABLE 5 



Molecular masses of protonated and sodiated substrate- conjugates, 
products, and internal standards for CDGS enzymes. 



m/z 



Substrate 



Product 



Internal 
standard 



Enzyme 


(M + H) + 


(M + Na) + 


(M + H) + 


(M + Na) 2 


(M + H) + 


(M + Na) + 


Type Ia,b 


711 


733 


549 


571 


555 


577 


Manose- 


725 


747 


563 


585 


570 


592 


transferase 














Type II 


1156 


1178 


1343 


1365 


1348 


1370 


Type 


1126 


1148 


2362 a 


2384 a 


2367 a 


2389 a 



a Calculated for the GlcNAc-T II product and internal standard containin a GlcNAc- 
GlcNAc-Mannose-(Mannose-GlcNAc) 2 residue. 



Scheme 1 



u 

X 



HN NH 




BIOTIN 



LINKER (heavy has 8 deuteriums) 



FUNCTIONAL GROUP 



Flowchart illustration of the procedure for quantitative protien profile 
measurements based on stable isotope ratios by mass spectromerty 



Sample protein mixture 



Derivatize cysteines 
with light isotope reagent 



Reference Protein mixture 




Derivatize cysteines 
with heavy isotope reagent 



Analysis and quantitation of derivatized peptides by LC-MS/MS 
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A 



HN NH 



HO — N 




(in) 



HN NH 




Scheme 4 



Linker j 

HM^r\^fJ Substrate Enzyme 
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30 



-continued 



Linker 




Linker / \ 

Q Handle ^/vw/ Substrate 



Enzyme 



Scheme 5 



Structures of Substrate Conjugates 1 and 2, Product 
40 Conjugates 3 and 4, and Internal Standards 5 and 6 



Scheme 5 

Structures of Substrate Conjugates 1 and 2, Product Conjugates 3 and 4, and Internal Standards 5 and 6 



O 

A 



HN NH 



CH 3 O 
I 



F F 




J 



NH 2 



F F 



OH -OH 



1 Ri = 
HO 




• = 0 = CH 2 

n = 1 OH 



2 Ri 



OH 

un 

HO^\^°\ 

Y" 

o 

-Q = CH 2 



n = 0 




Scheme 6 




Glutathione synthetase 




Arginase 



Mandelate racemase 
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OH 



HO 




Galactosyl- 
Cerebroside 



Glucose 



Scheme 8 
ATP Glucose-6-P 



Mannose 



Dol-P-mannose j 



GDP Dol-P 



GDP-mannose 



(CH 2 ) 12 CH 3 



ATP 



GTP 



Fructose- 
6-P 



PM 

lb 



Mannos 
e-6-P 



PM 

M2 



Mannos 
e-l-P 



Dol = Dolichol 




OR 



R = SiMe2 t-Bu 



Scheme 9 



1. (CF 3 COO) 2 Pd 
benzoquinone, 
AcOH 

2. CH 3 ONa/CH 3 OH 




OH 



1. Active ester coupling 

2. OH-deprotection 

3. Phosphorylation 
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B-S-Dol 10 -P X = H or D 
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OH 
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s 



TsOH, Py 
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D D 



R = NH 



D D 



-continued 
D D 



11 




D D / 




NCH 3 



12 



D D 



D D 



HO 




NC V 



C 2 H 5 ^ 



o v o 
x^^^^^^o' 

X= OH, 
X=C1, 
X=CN, 
X=CH 2 NH 2 , 



Scheme 17 
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40 



15 



16 



45 



17 



18 



H 



H 



50 



Scheme 18 



.OH 

NHAC 



X = NH 2 , 



X = NH 
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20 



or 



D D 



19 X = CN, 

60 



X = CH 2 NH 2 , 
X = CN, 
65 X = CH 2 NH 2 , 



D D 
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Schem e 19 

OH 




NH — X 
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SEQUENCE LISTING 



<160> NUMBER OF SEQ ID NOS : 6 4 

<210> SEQ ID NO 1 
<211> LENGTH: 7 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: 

Heptapeptide motif found in substrates for glycosylation 

<4 00> SEQUENCE: 1 

Tyr Gin Ser Asn Ser Thr Met 
1 5 



<210> SEQ ID NO 2 
<211> LENGTH: 7 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Test 
peptide 

<4 00> SEQUENCE: 2 

Lys Ala Leu Cys Ser Glu Lys 
1 5 



<210> SEQ ID NO 3 
<211> LENGTH: 6 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Test 
peptide 

<4 00> SEQUENCE: 3 

Lys Cys Glu Val Phe Arg 
1 5 



<210> SEQ ID NO 4 

<211> LENGTH: 9 

<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Test 
peptide 
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<4 00> SEQUENCE : 4 

Lys Leu Asp Gin Trp Leu Cys Glu Lys 
1 5 



<210> SEQ ID NO 5 
<211> LENGTH: 15 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Test 
peptide 

<4 00> SEQUENCE: 5 

Lys Phe Leu Asp Asp Asp Leu Thr Asp Asp lie Met Cys Val Lys 
15 10 15 



<210> SEQ ID NO 6 
<211> LENGTH: 18 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Test 
peptide 

<4 00> SEQUENCE: 6 

Lys Asp Asp Gin Asn Pro His Ser Ser Asn lie Cys Asn lie Ser Cys 
15 10 15 

Asp Lys 



<210> SEQ ID NO 7 
<211> LENGTH: 43 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Test 
peptide 

<4 00> SEQUENCE: 7 

Lys Gly Tyr Gly Gly Val Ser Leu Pro Glu Trp Val Cys Thr Thr Phe 
15 10 15 

His Thr Ser Gly Tyr Asp Thr Gin Ala lie Val Gin Asn Asn Asp Ser 
20 25 30 

Thr Glu Tyr Gly Leu Phe Gin lie Asn Asn Lys 
35 40 



<210> SEQ ID NO 8 

<211> LENGTH: 6 

<212> TYPE: PRT 

<213> ORGANISM: bovine 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (3) 

<223> OTHER INFORMATION: C at position 3 is ICAT- labeled cysteinyl 
residue 

<4 00> SEQUENCE: 8 

Ala Leu Cys Ser Glu Lys 
1 5 



<210> SEQ ID NO 9 

<211> LENGTH: 13 

<212> TYPE: PRT 

<213> ORGANISM: bovine 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (11) 
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<223> OTHER INFORMATION: C at position 11 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 9 

Phe Leu Asp Asp Leu Thr Asp Asp lie Met Cys Val Lys 
1 5 10 



<210> SEQ ID NO 10 

<211> LENGTH: 10 

<212> TYPE: PRT 

<213> ORGANISM: chicken 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (8) 

<223> OTHER INFORMATION: C at position 8 is ICAT-labeled cystenyl 
residue. 

<4 00> SEQUENCE: 10 

Ala Asp His Pro Phe Leu Phe Cys lie Lys 
1 5 10 



<210> SEQ ID NO 11 

<211> LENGTH: 12 

<212> TYPE: PRT 

<213> ORGANISM: chicken 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (10) 

<223> OTHER INFORMATION: C at position 10 is ICAT labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 11 

Tyr Pro lie Leu Pro Glu Tyr Leu Gin Cys Val Lys 
1 5 10 



<210> SEQ ID NO 12 

<211> LENGTH: 8 

<212> TYPE: PRT 

<213> ORGANISM: E coli 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (5) 

<223> OTHER INFORMATION: C at position 5 is ICAT-labeled cysteinyl 
residue . 

<400> SEQUENCE: 12 

Leu Thr Ala Ala Cys Phe Asp Arg 
1 5 



<210> SEQ ID NO 13 

<211> LENGTH: 13 

<212> TYPE: PRT 

<213> ORGANISM: E coli 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (5) 

<223> OTHER INFORMATION: C at position 5 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 13 

lie Gly Leu Asn Cys Gin Leu Ala Gin Val Ala Glu Arg 
15 10 



<210> SEQ ID NO 14 

<211> LENGTH: 17 

<212> TYPE: PRT 

<213> ORGANISM: E coli 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 
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<222> LOCATION: (14) 

<223> OTHER INFORMATION: C at position 14 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 14 

lie lie Phe Asp Gly Val Asn Ser Ala Phe His Leu Trp Cys Asn Gly 
1 5 10 15 

Arg 



<210> SEQ ID NO 15 

<211> LENGTH: 9 

<212> TYPE: PRT 

<213> ORGANISM: bovine 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (6) 

<223> OTHER INFORMATION: C at position 6 is ICAT-labeled cysteinyl 
residue. 

<400> SEQUENCE: 15 

Trp Glu Asn Gly Glu Cys Ala Gin Lys 
1 5 



<210> SEQ ID NO 16 

<211> LENGTH: 14 

<212> TYPE: PRT 

<213> ORGANISM: bovine 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (12) 

<223> OTHER INFORMATION: C at position 12 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 16 

Leu Ser Phe Asn Pro Thr Gin Leu Glu Glu Gin Cys His lie 
1 5 10 



<210> SEQ ID NO 17 

<211> LENGTH: 14 

<212> TYPE: PRT 

<213> ORGANISM: rabbit 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (13) 

<223> OTHER INFORMATION: C at position 13 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 17 

Val Pro Thr Pro Asn Val Ser Val Val Asp Leu Thr Cys Arg 
1 5 10 



<210> SEQ ID NO 18 

<211> LENGTH: 17 

<212> TYPE: PRT 

<213> ORGANISM: rabbit 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (1)..(17) 

<223> OTHER INFORMATION: C at positions 7 and 11 are ICAT-labeled 
cysteinyl residues. 

<4 00> SEQUENCE: 18 

lie Val Ser Asn Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Leu Ala 
15 10 15 

Lys 



<210> SEQ ID NO 19 
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<211> LENGTH: 15 

<212> TYPE: PRT 

<213> ORGANISM: rabbit 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (2) 

<223> OTHER INFORMATION: C at position 2 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 19 

lie Cys Gly Gly Trp Gin Met Glu Glu Ala Asp Asp Trp Leu Arg 
15 10 15 



<210> SEQ ID NO 20 

<211> LENGTH: 16 

<212> TYPE: PRT 

<213> ORGANISM: rabbit 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (2) 

<223> OTHER INFORMATION: C at position 2 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 2 0 

Thr Cys Ala Tyr Thr Asn His Thr Val Leu Pro Glu Ala Leu Glu Arg 
15 10 15 



<210> SEQ ID NO 21 

<211> LENGTH: 16 

<212> TYPE: PRT 

<213> ORGANISM: rabbit 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (5) 

<223> OTHER INFORMATION: C at position 5 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 21 

Trp Leu Val Leu Cys Asn Pro Gly Leu Ala Glu lie lie Ala Glu Arg 
15 10 15 



<210> SEQ ID NO 22 

<211> LENGTH: 12 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (4) 

<223> OTHER INFORMATION: C at position 4 is ICAT-labeled cysteinyl 
residue. 

<400> SEQUENCE: 22 

Lys His Asn Cys Leu His Glu Pro His Met Leu Lys 
1 5 10 



<210> SEQ ID NO 23 

<211> LENGTH: 21 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (5) 

<223> OTHER INFORMATION: C at position 5 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 2 3 

Tyr Ser Gly Val Cys His Thr Asp Leu His Ala Trp His Gly Asp Trp 
15 10 15 



Pro Leu Pro Val Lys 
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20 



<210> SEQ ID NO 24 

<211> LENGTH: 11 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (1)..(2) 

<223> OTHER INFORMATION: C at positions 1 and 2 are ICAT-labeled 
cysteinyl residues. 

<4 00> SEQUENCE: 2 4 

Cys Cys Ser Asp Val Phe Asn Gin Val Val Lys 
1 5 10 



<210> SEQ ID NO 25 

<211> LENGTH: 21 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (5) 

<223> OTHER INFORMATION: C at position 5 is ICAT-labeled cysteinyl 
residue. 

<400> SEQUENCE: 25 

Tyr Ser Gly Val Cys His Thr Asp Leu His Ala Trp His Gly Asp Trp 
15 10 15 

Pro Leu Pro Thr Lys 
20 



<210> SEQ ID NO 26 

<211> LENGTH: 11 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (1) 

<223> OTHER INFORMATION: C at position 1 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 2 6 

Cys Ser Ser Asp Val Phe Asn His Val Val Lys 
1 5 10 



<210> SEQ ID NO 27 

<211> LENGTH: 20 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (14) 

<223> OTHER INFORMATION: C at position 14 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 2 7 

Thr Phe Glu Val lie Asn Pro Ser Thr Glu Glu Glu lie Cys His lie 
15 10 15 

Tyr Glu Gly Arg 

20 



<210> SEQ ID NO 28 

<211> LENGTH: 12 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 
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<222> LOCATION: (9) 

<223> OTHER INFORMATION: C at position 9 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE : 2 8 

Ser Glu His Gin Val Glu Leu lie Cys Ser Tyr Arg 
1 5 10 



<210> SEQ ID NO 29 

<211> LENGTH: 12 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (5) 

<223> OTHER INFORMATION: C at position 9 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 2 9 

Tyr Arg Pro Asn Cys Pro lie lie Leu Val Thr Arg 
1 5 10 



<210> SEQ ID NO 30 

<211> LENGTH: 25 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (2) 

<223> OTHER INFORMATION: C at position 2 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 3 0 

Asn Cys Thr Pro Lys Pro Thr Ser Thr Thr Glu Thr Val Ala Ala Ser 
15 10 15 

Ala Val Ala Ala Val Phe Glu Gin Lys 
20 25 



<210> SEQ ID NO 31 

<211> LENGTH: 19 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (17) 

<223> OTHER INFORMATION: C at position 17 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 31 

Ser lie Ala Pro Ala Tyr Gly lie Pro Val Val Leu His Ser Asp His 
15 10 15 

Cys Ala Lys 



<210> SEQ ID NO 32 

<211> LENGTH: 6 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (5) 

<223> OTHER INFORMATION: C at position 5 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 32 



Glu Gin Val Gly Cys Lys 
1 5 
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<210> SEQ ID NO 33 

<211> LENGTH: 24 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (9) 

<223> OTHER INFORMATION: C at position 9 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 3 3 

Leu Thr Gly Ala Gly Trp Gly Gly Cys Thr Val His Leu Val Pro Gly 
15 10 15 

Gly Pro Asn Gly Asn lie Glu Lys 
20 



<210> SEQ ID NO 34 

<211> LENGTH: 13 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (11) 

<223> OTHER INFORMATION: C at position 11 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 3 4 

His His lie Pro Phe Tyr Glu Val Asp Leu Cys Asp Arg 
1 5 10 



<210> SEQ ID NO 35 

<211> LENGTH: 6 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (2) 

<223> OTHER INFORMATION: C at position 2 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 35 

Asp Cys Val Thr Leu Lys 
1 5 



<210> SEQ ID NO 36 

<211> LENGTH: 18 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (3) 

<223> OTHER INFORMATION: C at position 3 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 3 6 

Leu Trp Cys Thr Gin His His Glu Pro Glu Val Ala Leu Asp Gin Ser 
15 10 15 

Leu Lys 



<210> SEQ ID NO 37 

<211> LENGTH: 18 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (2) 

<223> OTHER INFORMATION: C at position 2 is ICAT labeled cysteinyl 
residue. 
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<4 00> SEQUENCE: 3 7 

lie Cys Ser Val Asn Leu His Gly Asp His Thr Phe Ser Met Glu Gin 
15 10 15 

Met Lys 



<210> SEQ ID NO 38 

<211> LENGTH: 6 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (2) 

<223> OTHER INFORMATION: C at position 2 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 3 8 

lie Cys Ser Gin Leu Lys 
1 5 



<210> SEQ ID NO 39 

<211> LENGTH: 9 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (5) 

<223> OTHER INFORMATION: C at position 5 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 3 9 

Gly Gly Thr Gin Cys Ser lie Met Arg 
1 5 



<210> SEQ ID NO 40 

<211> LENGTH: 30 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (2) 

<223> OTHER INFORMATION: C at position 2 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 4 0 

Asn Cys Phe Pro His His Gly Tyr lie His Asn Tyr Gly Ala Phe Pro 
15 10 15 

Gin Thr Trp Glu Asp Pro Asn Val Ser His Pro Glu Thr Lys 
20 25 30 



<210> SEQ ID NO 41 

<211> LENGTH: 13 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (2) 

<223> OTHER INFORMATION: C at position 2 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 41 

Val Cys His Ala His Pro Thr Leu Ser Glu Ala Phe Lys 
1 5 10 



<210> SEQ ID NO 42 

<211> LENGTH: 14 

<212> TYPE: PRT 

<213> ORGANISM: yeast 



US 6,852,544 B2 
97 98 

-continued 



<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (11) 

<223> OTHER INFORMATION: C at position 11 is ICAT-labeled cysteinyl 
residue. 

<400> SEQUENCE: 42 

Lys Gly Trp Thr Gly Gin Tyr Thr Leu Asp Cys Asn Thr Arg 
1 5 10 



<210> SEQ ID NO 43 

<211> LENGTH: 10 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (5) 

<223> OTHER INFORMATION: C at position 5 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 4 3 

Ser Val Val Leu Cys Asn Ser Thr lie Lys 
1 5 10 



<210> SEQ ID NO 44 

<211> LENGTH: 23 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (1) 

<223> OTHER INFORMATION: C at position 1 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 4 4 

Cys Thr Gly Gly lie lie Leu Thr Ala Ser His Asn Pro Gly Gly Pro 
15 10 15 

Glu Asn Asp Met Gly lie Lys 
20 



<210> SEQ ID NO 45 

<211> LENGTH: 17 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (4) 

<223> OTHER INFORMATION: C at position 4 is ICAT-labeled cysteinyl 
residue. 

<400> SEQUENCE: 45 

Leu Ser lie Cys Gly Glu Glu Ser Phe Gly Thr Gly Ser Asn His Val 
15 10 15 

Arg 



<210> SEQ ID NO 46 

<211> LENGTH: 10 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (3) 

<223> OTHER INFORMATION: C at position 3 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 4 6 



lie Pro Cys Leu Ala Asp Ser His Pro Lys 
15 10 
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<210> SEQ ID NO 47 

<211> LENGTH: 17 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (1) 

<223> OTHER INFORMATION: C at position 1 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 4 7 

Cys lie Asn Leu Ser Ala Glu Lys Glu Pro Glu lie Phe Asp Ala lie 
15 10 15 

Lys 



<210> SEQ ID NO 48 

<211> LENGTH: 12 

<212> TYPE: PRT 

<213> ORGANISM: Yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (1) 

<223> OTHER INFORMATION: C at position 1 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 4 8 

Cys Ala Tyr Pro lie Asp Tyr lie Pro Ser Ala Lys 
1 5 10 



<210> SEQ ID NO 49 

<211> LENGTH: 23 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (20) 

<223> OTHER INFORMATION: C at position 20 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 4 9 

lie Val Glu Glu Pro Thr Ser Lys Asp Glu lie Trp Trp Gly Pro Val 
15 10 15 

Asn Lys Pro Cys Ser Glu Arg 
20 



<210> SEQ ID NO 50 

<211> LENGTH: 12 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (9) 

<223> OTHER INFORMATION: C at position 9 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 5 0 

Ala Leu Val His His Tyr Glu Glu Cys Ala Glu Arg 
1 5 10 



<210> SEQ ID NO 51 

<211> LENGTH: 14 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (2) 

<223> OTHER INFORMATION: C at position 2 is ICAT-labeled cysteinyl 
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residue. 
<4 00> SEQUENCE : 51 

Ser Cys Gly Val Asp Ala Met Ser Val Asp Asp Leu Lys Lys 
1 5 10 



<210> SEQ ID NO 52 

<211> LENGTH: 24 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (8) 

<223> OTHER INFORMATION: C at position 8 is ICAT-labeled cysteinyl 
residue. 

<400> SEQUENCE: 52 

His Pro Glu Met Leu Glu Asp Cys Phe Gly Leu Ser Glu Glu Thr Thr 
15 10 15 

Thr Gly Val His His Leu Tyr Arg 
20 



<210> SEQ ID NO 53 

<211> LENGTH: 11 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (2) 

<223> OTHER INFORMATION: C at position 2 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 5 3 

Glu Cys lie Asn lie Lys Pro Gin Val Asp Arg 
1 5 10 



<210> SEQ ID NO 54 

<211> LENGTH: 25 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (14) 

<223> OTHER INFORMATION: C at position 14 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 5 4 

Gly Phe His lie His Glu Phe Gly Asp Ala Thr Asn Gly Cys Val Ser 
15 10 15 

Ala Gly Pro His Phe Asn Pro Phe Lys 
20 25 



<210> SEQ ID NO 55 

<211> LENGTH: 9 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (5) 

<223> OTHER INFORMATION: C at position 5 is ICAT-labeled cysteinyl 
residue. 

<400> SEQUENCE: 55 

Arg Gly Asn Val Cys Gly Asp Ala Lys 
1 5 



<210> SEQ ID NO 5 6 
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<211> LENGTH: 6 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (1) 

<223> OTHER INFORMATION: C at position 1 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 5 6 

Cys Gly Gly lie Asp Lys 
1 5 



<210> SEQ ID NO 57 

<211> LENGTH: 20 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (8) 

<223> OTHER INFORMATION: C at position 8 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 5 7 

Phe Val Pro Ser Lys Pro Met Cys Val Glu Ala Phe Ser Glu Tyr Pro 
15 10 15 

Pro Leu Gly Arg 
20 



<210> SEQ ID NO 58 

<211> LENGTH: 20 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (19) 

<223> OTHER INFORMATION: C at position 19 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 5 8 

lie Pro lie Phe Ser Ala Ser Gly Leu Pro His Asn Glu lie Ala Ala 
15 10 15 

Gin lie Cys Arg 
20 



<210> SEQ ID NO 59 

<211> LENGTH: 10 

<212> TYPE: PRT 

<213> ORGANISM: yeast 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (5) 

<223> OTHER INFORMATION: C at position 5 is ICAT-labeled cysteinyl 
residue. 

<4 00> SEQUENCE: 5 9 

His Tyr Ser Leu Cys Ser Ala Ser Thr Lys 
1 5 10 



<210> SEQ ID NO 60 

<211> LENGTH: 14 

<212> TYPE: PRT 

<213> ORGANISM: rabbit 

<220> FEATURE: 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (13) 

<223> OTHER INFORMATION: C at position 13 is ICAT-labeled cysteinyl 
residue. 
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-continued 



<4 00> SEQUENCE : 6 0 

Val Pro Thr Pro Asn Val Ser Val Val Asp Leu Thr Cys Arg 
1 5 10 



<210> SEQ ID NO 61 
<211> LENGTH: 18 
<212> TYPE: PRT 

<213> ORGANISM: Streptomyces lividans 
<4 00> SEQUENCE: 61 

Leu Gly Lys Pro Val Leu Thr Ala Asn Gin Val Thr lie Trp Glu Gly 
15 10 15 

Leu Arg 



<210> SEQ ID NO 62 

<211> LENGTH: 19 

<212> TYPE: PRT 

<213> ORGANISM: Unknown 

<220> FEATURE: 

<223> OTHER INFORMATION: Description of Unknown Organism: Unidentifed 



<4 00> SEQUENCE: 62 

lie Ala Asn Pro Asn Val Tyr Thr Glu Thr Leu Thr Ala Ala Thr Val 
15 10 15 



Cys Thr lie 



<210> SEQ ID NO 63 
<211> LENGTH: 19 
<212> TYPE: PRT 
<213> ORGANISM: Unknown 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Unknown Organism: Unidentifed 
<4 00> SEQUENCE: 6 3 

Leu Ala Leu Leu Pro Ser Asp Ala Glu Gly Pro His Gly Gin Phe Val 
15 10 15 

Thr Asp Lys 



<210> SEQ ID NO 64 

<211> LENGTH: 20 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapiens 

<4 00> SEQUENCE: 6 4 

Ala Leu Leu Val Leu Val Ala Pro Ala Met Ala Ala Gly Asn Gly Glu 
15 10 15 

Asp Leu Arg Asn 
20 



What is claimed is: 55 

1. A reagent for mass spectrometric analysis of proteins 
which has the general formula: 

A-L-PRG 

where A is an affinity label that selectively binds to a 60 
capture reagent, L is a linker group which is differen- 
tially labeled with stable isotopes and PRG is a protein 
reactive group that selectively reacts with protein func- 
tional groups. 

2. The reagent of claim 1 wherein PRG is a sulfhydryl 
reactive group or an amine reactive group. 65 

3. The reagent of claim 1 wherein PRG is an enzyme 
substrate. 



4. The reagent of claim 1 wherein the A-L-PRG is soluble 
in a sample liquid to be analyzed. 

5. The reagent of claim 1 wherein the linker is a cleavable 
linker. 

6. The reagent of claim 1 which has the general formula: 

A-B 1 — X 1 — (CH 2 )„ — [X 2 — (CH^mL — X 3 — (CH 2 ) P — X 4 — B 2 — 
PRG 

where: A is the affinity label; 
PRG is the protein reactive group; and 
B 1 -X 1 -(CH 2 )„-[X 2 -(CH 2 ) ro L-X 3 -(CH 2 ) p - 
X 4 — B 2 is the linker group wherein: 
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X 1 , X 2 , X 3 and X 4 , independently of one another, and X 2 nyl ester group, an amine reactive N-hydroxy succinimide 

independently of other X 2 , can be selected from the ester group, sulfonyl halide, isocyanate, isothiocyanante, 

group consisting of O, S, NH, NR, NRR + , CO, active ester, tertafluorophenyl ester, an acid halide, and an 

COO, COS, S— S, SO, S0 2 , CO— NR', CS— NR', Si— O, acid anyhydride; a homoserine lactone reactive primary 

and aryl or diaryl groups or X^X 4 may be absent; 5 amine group, and a carboxylic acid reactive amine, alcohols 

B 1 and B 2 , independently of one another, are optional or 2,3,5, 6-tetrafluorophenyl trifluoroacetate. 

groups selected from COO, CO, CO— NR', CS— NR', 17. The reagent of claim 1 wherein PRG is a substrate for 

(CH 2 ) q — CONR', (CH 2 )^ — CS — NR', or (CH 2 ) q ; an enzyme an enzymatic deficiency of which is linked to a 

n, m, p, q and x are whole numbers that can take values disease state. 

from 0 to about 100, where the sum of n+xm+p+q is 18. The reagent of claim 1 wherein PRG is a substrate for 

less than about 100; an enzyme which is associated with a birth defect. 

R is an alkyl, alkenyl, alkynyl, alkoxy or an aryl group 19 The reagent of claim 1 wherein PRG is a substrate for 

that is optionally substituted with one or more alkyl, an enzyme which is associated with a lysosomal storage 

alkenyl, alkynyl, or alkoxy groups; and 15 disease. 

R is a hydrogen, an alkyl, alkenyl, alkynyl, alkoxy or an 20 The rea § ent of claim 1 wherein PRG is a substrate for 

aryl group that is optionally substituted with one or acid sphingomyelinase, galactocerebroside (3-galactosidase, 

more alkyl, alkenyl, alkynyl, or alkoxy groups (3-galactosidase, acetyl-a-D-glucosaminidase, heparan 

wherein one or more of the CH 2 groups in the linker can sulfamidase, acetyl-Co A-a-D -glucosaminide 

be optionally substituted with alkyl, alkenyl, alkoxy 20 N-acetyltransferase or N-acetylglucosamine-6-sulfatase. 

groups, an aryl group that is optionally substituted with 21 Tne reagent of claim 1 wherein the linker contains a 

one or more alkyl, alkenyl, alkynyl, or alkoxy groups, disulfide group. 

an acidic group, a basic group or a group carrying a 22. The reagent of claim 1 wherein any atom of the linker 

permanent positive or negative charge; wherein one or may be substituted with a heavy isotope. 

more single bonds linking non-adjacent CH 2 groups in 25 23. A reagent kit for the analysis of proteins by mass 

the linker can be replaced with a double or a triple bond spectral analysis that comprises a reagent of claim 1. 

and wherein one or more of the atoms in the linker can 24. The reagent kit of claim 23 that comprises one or more 

be substituted with a stable isotope. reagents of claim 1. 

7. The reagent of claim 6 wherein at least one of B 1 or B 2 25. The reagent kit of claim 23 further comprising one or 
is CO-NR' or CS-NR. 30 more proteolytic enzymes for use in digestion of affinity 

8. The reagent of claim 6 wherein X 1 and X 4 are selected tagged proteins. 

from NH, NR, and NRR ,+ , X 3 is O and all X 2 groups are O. 26. The reagent kit of claim 23 which comprises a set of 

9. A reagent kit for the analysis of proteins by mass substantially chemically identical differentially labeled 
spectral analysis that comprises a reagent of claim 6. affinity tagged reagents. 

10. The reagent of claim 1 wherein the affinity label is 35 27. The reagent kit of claim 23 wherein the reagent is an 
biotin or a modified biotin. affinity tagged enzyme substrate reagent. 

11. The reagent of claim 1 wherein the affinity label is 28. The reagent kit of claim 27 which comprises a set of 
selected from the group consisting of a 1,2-diol, glutathione, substantially chemically identical differentially labeled 
maltose, a nitrilotriacetic acid group, and an oligohistidine. affinity tagged enzyme substrates. 

12. The reagent of claim 1 wherein the affinity label is a 40 29. The reagent kit of claim 28 further comprising a set of 
hapten. substantially chemically identical differentially labeled 

13. The reagent of claim 1 wherein PRG is a sulfhydryl- affinity tagged enzyme products. 

reactive group. 30. The reagent kit of claim 23 wherein the PGR of the 

14. The reagent of claim 1 wherein PRG is an lodoacety- reagent is an enzyme substrate. 

lamide group, an epoxide, an a-haloacyl group, a nitriles, a 45 31. The reagent kit of claim 23 wherein the PRG of the 

sulfonated alkyl, an aryl thiols or a maleimide. reagent is a sulfhydryl reactive group or an amine reactive 

15. The reagent of claim 1 wherein PRG is an amine group. 

reactive group, a group that reacts with a homoserine lactone 32. The reagent kit of claim 23 wherein the linker of the 

or a group that reacts with carboxylic acid group. reagent is a cleavable linker. 

16. The reagent of claim 1 wherein PRG is selected from 50 

the groups consisting of an amine reactive pentafluorophe- * * * * * 



